You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/speech-service/sovereign-clouds.md
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,7 +7,7 @@ manager: nitinme
7
7
ms.service: azure-ai-speech
8
8
ms.topic: how-to
9
9
ms.custom: references_regions
10
-
ms.date: 11/17/2023
10
+
ms.date: 1/21/2024
11
11
ms.author: alexeyo
12
12
---
13
13
@@ -64,7 +64,7 @@ Replace `<REGION_IDENTIFIER>` with the identifier matching the region of your su
64
64
65
65
#### Speech SDK
66
66
67
-
For [Speech SDK](speech-sdk.md) in sovereign clouds you need to use "from host / with host" instantiation of `SpeechConfig` class or `--host` option of [Speech CLI](spx-overview.md). (You may also use "from endpoint / with endpoint" instantiation and `--endpoint` Speech CLI option).
67
+
For [Speech SDK](speech-sdk.md) in sovereign clouds, you need to use "from host / with host" instantiation of `SpeechConfig` class or `--host` option of [Speech CLI](spx-overview.md). (You can also use "from endpoint / with endpoint" instantiation and `--endpoint` Speech CLI option).
68
68
69
69
`SpeechConfig` class should be instantiated like this:
70
70
@@ -161,7 +161,7 @@ Replace `<REGION_IDENTIFIER>` with the identifier matching the region of your su
161
161
162
162
#### Speech SDK
163
163
164
-
For [Speech SDK](speech-sdk.md) in sovereign clouds you need to use "from host / with host" instantiation of `SpeechConfig` class or `--host` option of [Speech CLI](spx-overview.md). (You may also use "from endpoint / with endpoint" instantiation and `--endpoint` Speech CLI option).
164
+
For [Speech SDK](speech-sdk.md) in sovereign clouds you need to use "from host / with host" instantiation of `SpeechConfig` class or `--host` option of [Speech CLI](spx-overview.md). (You can also use "from endpoint / with endpoint" instantiation and `--endpoint` Speech CLI option).
165
165
166
166
`SpeechConfig` class should be instantiated like this:
@@ -54,9 +54,9 @@ Enrollment for speaker identification is text-independent. There are no restrict
54
54
55
55
Speaker enrollment data is stored in a secured system, including the speech audio for enrollment and the voice signature features. The speech audio for enrollment is only used when the algorithm is upgraded, and the features need to be extracted again. The service doesn't retain the speech recording or the extracted voice features that are sent to the service during the recognition phase.
56
56
57
-
You control how long data should be retained. You can create, update, and delete enrollment data for individual speakers through API calls. When the subscription is deleted, all the speaker enrollment data associated with the subscription will also be deleted.
57
+
You control how long data should be retained. You can create, update, and delete enrollment data for individual speakers through API calls. When the subscription is deleted, all the speaker enrollment data associated with the subscription is also deleted.
58
58
59
-
As with all of the Azure AI services resources, developers who use the speaker recognition feature must be aware of Microsoft policies on customer data. You should ensure that you have received the appropriate permissions from the users. You can find more details in [Data and privacy for speaker recognition](/legal/cognitive-services/speech-service/speaker-recognition/data-privacy-speaker-recognition). For more information, see the [Azure AI services page](https://azure.microsoft.com/support/legal/cognitive-services-compliance-and-privacy/) on the Microsoft Trust Center.
59
+
As with all of the Azure AI services resources, developers who use the speaker recognition feature must be aware of Microsoft policies on customer data. You should ensure that you received the appropriate permissions from the users. You can find more details in [Data and privacy for speaker recognition](/legal/cognitive-services/speech-service/speaker-recognition/data-privacy-speaker-recognition). For more information, see the [Azure AI services page](https://azure.microsoft.com/support/legal/cognitive-services-compliance-and-privacy/) on the Microsoft Trust Center.
60
60
61
61
## Common questions and solutions
62
62
@@ -72,7 +72,7 @@ As with all of the Azure AI services resources, developers who use the speaker r
72
72
73
73
## Responsible AI
74
74
75
-
An AI system includes not only the technology, but also the people who will use it, the people who will be affected by it, and the environment in which it is deployed. Read the transparency notes to learn about responsible AI use and deployment in your systems.
75
+
An AI system includes not only the technology, but also the people who use it, the people who are affected by it, and the environment in which it's deployed. Read the transparency notes to learn about responsible AI use and deployment in your systems.
76
76
77
77
*[Transparency note and use cases](/legal/cognitive-services/speech-service/speaker-recognition/transparency-note-speaker-recognition?context=/azure/ai-services/speech-service/context/context)
78
78
*[Characteristics and limitations](/legal/cognitive-services/speech-service/speaker-recognition/characteristics-and-limitations-speaker-recognition?context=/azure/ai-services/speech-service/context/context)
Copy file name to clipboardExpand all lines: articles/ai-services/speech-service/speech-container-batch-processing.md
+17-17Lines changed: 17 additions & 17 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,7 +6,7 @@ author: eric-urban
6
6
manager: nitinme
7
7
ms.service: azure-ai-speech
8
8
ms.topic: how-to
9
-
ms.date: 10/22/2020
9
+
ms.date: 1/21/2024
10
10
ms.author: eur
11
11
---
12
12
@@ -16,15 +16,15 @@ Use the batch processing kit to complement and scale out workloads on Speech con
16
16
17
17
:::image type="content" source="media/containers/general-diagram.png" alt-text="A diagram showing an example batch-kit container workflow.":::
18
18
19
-
The batch kit container is available for free on [GitHub](https://github.com/microsoft/batch-processing-kit) and [Docker hub](https://hub.docker.com/r/batchkit/speech-batch-kit/tags). You are only [billed](speech-container-overview.md#billing) for the Speech containers you use.
19
+
The batch kit container is available for free on [GitHub](https://github.com/microsoft/batch-processing-kit) and [Docker hub](https://hub.docker.com/r/batchkit/speech-batch-kit/tags). You're only [billed](speech-container-overview.md#billing) for the Speech containers you use.
20
20
21
21
| Feature | Description |
22
22
|---------|---------|
23
23
| Batch audio file distribution | Automatically dispatch large numbers of files to on-premises or cloud-based Speech container endpoints. Files can be on any POSIX-compliant volume, including network filesystems. |
24
24
| Speech SDK integration | Pass common flags to the Speech SDK, including: n-best hypotheses, diarization, language, profanity masking. |
25
25
|Run modes | Run the batch client once, continuously in the background, or create HTTP endpoints for audio files. |
26
26
| Fault tolerance | Automatically retry and continue transcription without losing progress, and differentiate between which errors can, and can't be retried on. |
27
-
| Endpoint availability detection | If an endpoint becomes unavailable, the batch client will continue transcribing, using other container endpoints. After becoming available again, the client will automatically begin using the endpoint. |
27
+
| Endpoint availability detection | If an endpoint becomes unavailable, the batch client continues transcribing, using other container endpoints. When the client is available it automatically begins using the endpoint. |
28
28
| Endpoint hot-swapping | Add, remove, or modify Speech container endpoints during runtime without interrupting the batch progress. Updates are immediate. |
29
29
| Real-time logging | Real-time logging of attempted requests, timestamps, and failure reasons, with Speech SDK log files for each audio file. |
30
30
@@ -72,7 +72,7 @@ MyContainer3:
72
72
73
73
This yaml example specifies three speech containers on three hosts. The first host is specified by a IPv4 address, the second is running on the same VM as the batch-client, and the third container is specified by the DNS hostname of another VM. The `concurrency` value specifies the maximum concurrent file transcriptions that can run on the same container. The `rtf` (Real-Time Factor) value is optional and can be used to tune performance.
74
74
75
-
The batch client can dynamically detect if an endpoint becomes unavailable (for example, due to a container restart or networking issue), and when it becomes available again. Transcription requests will not be sent to containers that are unavailable, and the client will continue using other available containers. You can add, remove, or edit endpoints at any time without interrupting the progress of your batch.
75
+
The batch client can dynamically detect if an endpoint becomes unavailable (for example, due to a container restart or networking issue), and when it becomes available again. Transcription requests won't be sent to containers that are unavailable, and the client continues using other available containers. You can add, remove, or edit endpoints at any time without interrupting the progress of your batch.
76
76
77
77
78
78
@@ -538,10 +538,10 @@ The batch client can dynamically detect if an endpoint becomes unavailable (for
538
538
539
539
> [!NOTE]
540
540
> * This example uses the same directory (`/my_nfs`) for the configuration file and the inputs, outputs, and logs directories. You can use hosted or NFS-mounted directories for these folders.
541
-
> * Running the client with `–h` will list the available command-line parameters, and their default values.
541
+
> * Running the client with the `–h` flag lists the available command-line parameters, and their default values.
542
542
> * The batch processing container is only supported on Linux.
543
543
544
-
Use the Docker `run` command to start the container. This will start an interactive shell inside the container.
544
+
Use the Docker `run` command to start the container. This command starts an interactive shell inside the container.
The client will start running. If an audio file has already been transcribed in a previous run, the client will automatically skip the file. Files are sent with an automatic retry if transient errors occur, and you can differentiate between which errors you want to the client to retry on. On a transcription error, the client will continue transcription, and can retry without losing progress.
946
+
The client starts running. If an audio file was transcribed in a previous run, the client automatically skips the file. Files are sent with an automatic retry if transient errors occur, and you can differentiate between which errors you want to the client to retry on. On a transcription error, the client continues transcription, and can retry without losing progress.
947
947
948
948
## Run modes
949
949
@@ -955,9 +955,9 @@ The batch processing kit offers three modes, using the `--run-mode` parameter.
955
955
956
956
:::image type="content" source="media/containers/batch-oneshot-mode.png" alt-text="A diagram showing the batch-kit container processing files in oneshot mode.":::
957
957
958
-
1. Define the Speech container endpoints that the batch client will use in the `config.yaml` file.
958
+
1. Define the Speech container endpoints that the batch client uses in the `config.yaml` file.
959
959
2. Place audio files for transcription in an input directory.
960
-
3. Invoke the container on the directory, which will begin processing the files. If the audio file has already been transcribed in a previous run with the same output directory (same file name and checksum), the client will skip the file.
960
+
3. Invoke the container on the directory to begin processing the files. If the audio file is already transcribed in a previous run with the same output directory (same file name and checksum), the client skips the file.
961
961
4. The files are dispatched to the container endpoints from step 1.
962
962
5. Logs and the Speech container output are returned to the specified output directory.
963
963
@@ -966,13 +966,13 @@ The batch processing kit offers three modes, using the `--run-mode` parameter.
966
966
> [!TIP]
967
967
> If multiple files are added to the input directory at the same time, you can improve performance by instead adding them in a regular interval.
968
968
969
-
`DAEMON`mode transcribes existing files in a given folder, and continuously transcribes new audio files as they are added.
969
+
`DAEMON`mode transcribes existing files in a given folder, and continuously transcribes new audio files as they're added.
1. Define the Speech container endpoints that the batch client will use in the `config.yaml` file.
974
-
2. Invoke the container on an input directory. The batch client will begin monitoring the directory for incoming files.
975
-
3. Set up continuous audio file delivery to the input directory. If the audio file has already been transcribed in a previous run with the same output directory (same file name and checksum), the client will skip the file.
973
+
1. Define the Speech container endpoints that the batch client uses in the `config.yaml` file.
974
+
2. Invoke the container on an input directory. The batch client begins monitoring the directory for incoming files.
975
+
3. Set up continuous audio file delivery to the input directory. If the audio file was transcribed in a previous run with the same output directory (same file name and checksum), the client skips the file.
976
976
4. Once a file write or POSIX signal is detected, the container is triggered to respond.
977
977
5. The files are dispatched to the container endpoints from step 1.
978
978
6. Logs and the Speech container output are returned to the specified output directory.
@@ -983,16 +983,16 @@ The batch processing kit offers three modes, using the `--run-mode` parameter.
983
983
984
984
:::image type="content" source="media/containers/batch-rest-api-mode.png" alt-text="A diagram showing the batch-kit container processing files in REST mode.":::
985
985
986
-
1. Define the Speech container endpoints that the batch client will use in the `config.yaml` file.
986
+
1. Define the Speech container endpoints that the batch client uses in the `config.yaml` file.
987
987
2. Send an HTTP request to one of the API server's endpoints.
988
988
989
989
|Endpoint |Description |
990
990
|---------|---------|
991
991
|`/submit` | Endpoint for creating new batch requests. |
992
-
|`/status` | Endpoint for checking the status of a batch request. The connection will stay open until the batch completes. |
992
+
|`/status` | Endpoint for checking the status of a batch request. The connection stays open until the batch completes. |
993
993
|`/watch` | Endpoint for using HTTP long polling until the batch completes. |
994
994
995
-
3. Audio files are uploaded from the input directory. If the audio file has already been transcribed in a previous run with the same output directory (same file name and checksum), the client will skip the file.
995
+
3. Audio files are uploaded from the input directory. If the audio file was transcribed in a previous run with the same output directory (same file name and checksum), the client skips the file.
996
996
4. If a request is sent to the `/submit` endpoint, the files are dispatched to the container endpoints from step
997
997
5. Logs and the Speech container output are returned to the specified output directory.
998
998
@@ -1004,7 +1004,7 @@ The batch processing kit offers three modes, using the `--run-mode` parameter.
1004
1004
1005
1005
The client creates a *run.log* file in the directory specified by the `-log_folder` argument in the docker `run` command. Logs are captured at the DEBUG level by default. The same logs are sent to the `stdout/stderr`, and filtered depending on the `-file_log_level` or `console_log_level` arguments. This log is only necessary for debugging, or if you need to send a trace for support. The logging folder also contains the Speech SDK logs for each audio file.
1006
1006
1007
-
The output directory specified by `-output_folder` will contain a *run_summary.json* file, which is periodically rewritten every 30 seconds or whenever new transcriptions are finished. You can use this file to check on progress as the batch proceeds. It will also contain the final run statistics and final status of every file when the batch is completed. The batch is completed when the process has a clean exit.
1007
+
The output directory specified by `-output_folder` contains a *run_summary.json* file, which is periodically rewritten every 30 seconds or whenever new transcriptions are finished. You can use this file to check on progress as the batch proceeds. It also contains the final run statistics and final status of every file when the batch is completed. The batch is completed when the process has a clean exit.
0 commit comments