You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/speech-service/speech-container-batch-processing.md
+7-6Lines changed: 7 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,8 +6,9 @@ author: eric-urban
6
6
manager: nitinme
7
7
ms.service: azure-ai-speech
8
8
ms.topic: how-to
9
-
ms.date: 1/21/2024
9
+
ms.date: 9/24/2024
10
10
ms.author: eur
11
+
#Customer intent: As a developer, I want to learn how to use the batch processing kit to scale Speech container requests.
11
12
---
12
13
13
14
# Batch processing kit for Speech containers
@@ -24,7 +25,7 @@ The batch kit container is available for free on [GitHub](https://github.com/mic
24
25
| Speech SDK integration | Pass common flags to the Speech SDK, including: n-best hypotheses, diarization, language, profanity masking. |
25
26
|Run modes | Run the batch client once, continuously in the background, or create HTTP endpoints for audio files. |
26
27
| Fault tolerance | Automatically retry and continue transcription without losing progress, and differentiate between which errors can, and can't be retried on. |
27
-
| Endpoint availability detection | If an endpoint becomes unavailable, the batch client continues transcribing, using other container endpoints. When the client is available it automatically begins using the endpoint. |
28
+
| Endpoint availability detection | If an endpoint becomes unavailable, the batch client continues transcribing, using other container endpoints. When the client is available, it automatically begins using the endpoint. |
28
29
| Endpoint hot-swapping | Add, remove, or modify Speech container endpoints during runtime without interrupting the batch progress. Updates are immediate. |
29
30
| Real-time logging | Real-time logging of attempted requests, timestamps, and failure reasons, with Speech SDK log files for each audio file. |
The batch client takes a yaml configuration file that specifies the on-premises container endpoints. The following example can be written to `/mnt/my_nfs/config.yaml`, which is used in the examples below.
44
+
The batch client takes a yaml configuration file that specifies the on-premises container endpoints. The following example can be written to `/mnt/my_nfs/config.yaml`, which is used in the following examples.
44
45
45
46
```yaml
46
47
MyContainer1:
@@ -62,7 +63,7 @@ MyContainer3:
62
63
63
64
This yaml example specifies three speech containers on three hosts. The first host is specified by a IPv4 address, the second is running on the same VM as the batch-client, and the third container is specified by the DNS hostname of another VM. The `concurrency` value specifies the maximum concurrent file transcriptions that can run on the same container. The `rtf` (Real-Time Factor) value is optional and can be used to tune performance.
64
65
65
-
The batch client can dynamically detect if an endpoint becomes unavailable (for example, due to a container restart or networking issue), and when it becomes available again. Transcription requests won't be sent to containers that are unavailable, and the client continues using other available containers. You can add, remove, or edit endpoints at any time without interrupting the progress of your batch.
66
+
The batch client can dynamically detect if an endpoint becomes unavailable (for example, due to a container restart or networking issue), and when it becomes available again. Transcription requests aren't sent to containers that are unavailable, and the client continues using other available containers. You can add, remove, or edit endpoints at any time without interrupting the progress of your batch.
66
67
67
68
## Run the batch processing container
68
69
@@ -95,11 +96,11 @@ The client starts running. If an audio file was transcribed in a previous run, t
95
96
96
97
The batch processing kit offers three modes, using the `--run-mode` parameter.
97
98
98
-
#### [Oneshot](#tab/oneshot)
99
+
#### [One-shot](#tab/oneshot)
99
100
100
101
`ONESHOT`mode transcribes a single batch of audio files (from an input directory and optional file list) to an output folder.
101
102
102
-
:::image type="content" source="media/containers/batch-oneshot-mode.png" alt-text="A diagram showing the batch-kit container processing files in oneshot mode.":::
103
+
:::image type="content" source="media/containers/batch-oneshot-mode.png" alt-text="A diagram showing the batch-kit container processing files in one-shot mode.":::
103
104
104
105
1. Define the Speech container endpoints that the batch client uses in the `config.yaml` file.
105
106
2. Place audio files for transcription in an input directory.
0 commit comments