Learn Editor: Update speech-container-batch-processing.md

goergenj · goergenj · commit 2b0aec27ec73 · 2023-01-18T13:24:05.000-08:00
diff --git a/articles/cognitive-services/Speech-Service/speech-container-batch-processing.md b/articles/cognitive-services/Speech-Service/speech-container-batch-processing.md
@@ -42,7 +42,7 @@ docker pull docker.io/batchkit/speech-batch-kit:latest
 
 ## Endpoint configuration
 
-The batch client takes a yaml configuration file that specifies the on-prem container endpoints. The following example can be written to `/mnt/my_nfs/config.yaml`, which is used in the examples below. 
+The batch client takes a yaml configuration file that specifies the on-premises container endpoints. The following example can be written to `/mnt/my_nfs/config.yaml`, which is used in the examples below. 
 
 
 
@@ -64,13 +64,20 @@ MyContainer3:
   rtf: 4
 ```
 
-This yaml example specifies three speech containers on three hosts. The first host is specified by a IPv4 address, the second is running on the same VM as the batch-client, and the third container is specified by the DNS hostname of another VM. The `concurrency` value specifies the maximum concurrent file transcriptions that can run on the same container. The `rtf` (Real-Time Factor) value is optional, and can be used to tune performance.
+This yaml example specifies three speech containers on three hosts. The first host is specified by a IPv4 address, the second is running on the same VM as the batch-client, and the third container is specified by the DNS hostname of another VM. The `concurrency` value specifies the maximum concurrent file transcriptions that can run on the same container. The `rtf` (Real-Time Factor) value is optional and can be used to tune performance.
+
 The batch client can dynamically detect if an endpoint becomes unavailable (for example, due to a container restart or networking issue), and when it becomes available again. Transcription requests will not be sent to containers that are unavailable, and the client will continue using other available containers. You can add, remove, or edit endpoints at any time without interrupting the progress of your batch.
 
 
 
 
 
+
+
+
+
+
+
 ## Run the batch processing container
   
 > [!NOTE] 
@@ -83,6 +90,10 @@ Use the Docker `run` command to start the container. This will start an interact
 
 
 
+
+
+
+
 ```Docker
 docker run --network host --rm -ti -v /mnt/my_nfs:/my_nfs --entrypoint /bin/bash /mnt/my_nfs:/my_nfs docker.io/batchkit/speech-batch-kit:latest
 ```
@@ -91,6 +102,8 @@ To run the batch client:
 
 
 
+
+
 ```Docker
 run-batch-client -config /my_nfs/config.yaml -input_folder /my_nfs/audio_files -output_folder /my_nfs/transcriptions -log_folder /my_nfs/logs -file_log_level DEBUG -nbest 1 -m ONESHOT -diarization None -language en-US -strict_config
 ```
@@ -100,11 +113,16 @@ To run the batch client and container in a single command:
 
 
 
+
+
+
+
 ```Docker
 docker run --network host --rm -ti -v /mnt/my_nfs:/my_nfs docker.io/batchkit/speech-batch-kit:latest -config /my_nfs/config.yaml -input_folder /my_nfs/audio_files -output_folder /my_nfs/transcriptions -log_folder /my_nfs/logs
 ```
 
 
+
 The client will start running. If an audio file has already been transcribed in a previous run, the client will automatically skip the file. Files are sent with an automatic retry if transient errors occur, and you can differentiate between which errors you want to the client to retry on. On a transcription error, the client will continue transcription, and can retry without losing progress.  
 
 ## Run modes 
@@ -141,18 +159,17 @@ The batch processing kit offers three modes, using the `--run-mode` parameter.
 
 #### [REST](#tab/rest)
 
-`REST` mode is an API server mode that provides a basic set of HTTP endpoints for audio file batch submission, status checking, and long polling. Also enables programmatic consumption using a Python module extension, or importing as a submodule.
+`REST` mode is an API server mode that provides a basic set of HTTP endpoints for audio file batch submission, status checking, and long polling. Also enables programmatic consumption using a Python module extension or importing as a submodule.
 
 :::image type="content" source="media/containers/batch-rest-api-mode.png" alt-text="A diagram showing the batch-kit container processing files in REST mode.":::
 
 1. Define the Speech container endpoints that the batch client will use in the `config.yaml` file. 
-2. Send an HTTP request request to one of the API server's endpoints. 
-        
+1. Send an HTTP request to one of the API server's endpoints. 
     |Endpoint  |Description  |
-    |---------|---------|
-    |`/submit`     | Endpoint for creating new batch requests.        |
-    |`/status`     | Endpoint for checking the status of a batch request. The connection will stay open until the batch completes.       |
-    |`/watch`     | Endpoint for using HTTP long polling until the batch completes.        |
+ |---------|---------|
+ |`/submit`     | Endpoint for creating new batch requests.        |
+ |`/status`     | Endpoint for checking the status of a batch request. The connection will stay open until the batch completes.       |
+ |`/watch`     | Endpoint for using HTTP long polling until the batch completes.        |
 
 3. Audio files are uploaded from the input directory. If the audio file has already been transcribed in a previous run with the same output directory (same file name and checksum), the client will skip the file. 
 4. If a request is sent to the `/submit` endpoint, the files are dispatched to the container endpoints from step 1.
@@ -173,3 +190,4 @@ The output directory specified by `-output_folder` will contain a *run_summary.j
 * [How to install and run containers](speech-container-howto.md)
 
 
+