MicrosoftDocs
diff --git a/‎articles/cognitive-services/Speech-Service/media/containers/mcr-tags-custom-speech-to-text.png
113 KB b/‎articles/cognitive-services/Speech-Service/media/containers/mcr-tags-custom-speech-to-text.png
113 KB
diff --git a/‎articles/cognitive-services/Speech-Service/media/containers/mcr-tags-language-detection.png
120 KB b/‎articles/cognitive-services/Speech-Service/media/containers/mcr-tags-language-detection.png
120 KB
diff --git a/‎articles/cognitive-services/Speech-Service/media/containers/mcr-tags-neural-text-to-speech.png
136 KB b/‎articles/cognitive-services/Speech-Service/media/containers/mcr-tags-neural-text-to-speech.png
136 KB
diff --git a/‎articles/cognitive-services/Speech-Service/media/containers/mcr-tags-speech-to-text.png
107 KB b/‎articles/cognitive-services/Speech-Service/media/containers/mcr-tags-speech-to-text.png
107 KB
diff --git a/‎articles/cognitive-services/Speech-Service/speech-container-batch-processing.md
Lines changed: 3 additions & 3 deletions b/‎articles/cognitive-services/Speech-Service/speech-container-batch-processing.md
Lines changed: 3 additions & 3 deletions
diff --git a/‎articles/cognitive-services/Speech-Service/speech-container-cstt.md
Lines changed: 58 additions & 39 deletions b/‎articles/cognitive-services/Speech-Service/speech-container-cstt.md
Lines changed: 58 additions & 39 deletions
diff --git a/‎articles/cognitive-services/Speech-Service/speech-container-howto.md
Lines changed: 4 additions & 1 deletion b/‎articles/cognitive-services/Speech-Service/speech-container-howto.md
Lines changed: 4 additions & 1 deletion
diff --git a/‎articles/cognitive-services/Speech-Service/speech-container-lid.md
Lines changed: 4 additions & 3 deletions b/‎articles/cognitive-services/Speech-Service/speech-container-lid.md
Lines changed: 4 additions & 3 deletions
diff --git a/‎articles/cognitive-services/Speech-Service/speech-container-ntts.md
Lines changed: 3 additions & 2 deletions b/‎articles/cognitive-services/Speech-Service/speech-container-ntts.md
Lines changed: 3 additions & 2 deletions
diff --git a/‎articles/cognitive-services/Speech-Service/speech-container-overview.md
Lines changed: 1 addition & 1 deletion b/‎articles/cognitive-services/Speech-Service/speech-container-overview.md
Lines changed: 1 addition & 1 deletion
@@ -712,7 +712,7 @@ Use the Docker `run` command to start the container. This will start an interact
 
 
 
-```Docker
+```bash
 docker run --network host --rm -ti -v /mnt/my_nfs:/my_nfs --entrypoint /bin/bash /mnt/my_nfs:/my_nfs docker.io/batchkit/speech-batch-kit:latest
 ```
 
@@ -758,7 +758,7 @@ To run the batch client:
 
 
 
-```Docker
+```bash
 run-batch-client -config /my_nfs/config.yaml -input_folder /my_nfs/audio_files -output_folder /my_nfs/transcriptions -log_folder /my_nfs/logs -file_log_level DEBUG -nbest 1 -m ONESHOT -diarization None -language en-US -strict_config
 ```
 
@@ -931,7 +931,7 @@ To run the batch client and container in a single command:
 
 
 
-```Docker
+```bash
 docker run --network host --rm -ti -v /mnt/my_nfs:/my_nfs docker.io/batchkit/speech-batch-kit:latest -config /my_nfs/config.yaml -input_folder /my_nfs/audio_files -output_folder /my_nfs/transcriptions -log_folder /my_nfs/logs
 ```
 
 
@@ -16,57 +16,66 @@ keywords: on-premises, Docker, container
 
 # Custom speech-to-text containers with Docker
 
-By using containers, you can run _some_ of the Azure Cognitive Services Speech service APIs in your own environment. Containers are great for specific security and data governance requirements. In this article, you'll learn how to download, install, and run a Speech container.
+The Custom speech-to-text container transcribes real-time speech or batch audio recordings with intermediate results. You can use a custom model that you created in the [Custom Speech portal](https://speech.microsoft.com/customspeech). In this article, you'll learn how to download, install, and run a Custom speech-to-text container.
 
-With Speech containers, you can build a speech application architecture that's optimized for both robust cloud capabilities and edge locality. Several containers are available, which use the same [pricing](https://azure.microsoft.com/pricing/details/cognitive-services/speech-services/) as the cloud-based Azure Speech services.
-
-## Available Speech containers
+> [!NOTE]
+> You must [request and get approval](speech-container-overview.md#request-approval-to-run-the-container) to use a Speech container. 
 
+For more information about prerequisites, validating that a container is running, running multiple containers on the same host, and running disconnected containers, see [Install and run Speech containers with Docker](speech-container-howto.md).
 
-Using a custom model from the [Custom Speech portal](https://speech.microsoft.com/customspeech), transcribes continuous real-time speech or batch audio recordings into text with intermediate results.
+## Container images
 
-The latest supported version  is 3.12.0. For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/custom-speech-to-text/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/speech-to-text/tags/list).
+The Custom speech-to-text container image for all supported versions and locales can be found on the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/custom-speech-to-text/tags) syndicate. It resides within the `azure-cognitive-services/speechservices/` repository and is named `custom-speech-to-text`. 
 
-You need the [prerequisites](speech-container-howto.md#prerequisites).
+:::image type="content" source="./media/containers/mcr-tags-custom-speech-to-text.png" alt-text="A screenshot of the search connectors and triggers dialog." lightbox="./media/containers/mcr-tags-custom-speech-to-text.png":::
 
-## Speech container images
+The fully qualified container image name is, `mcr.microsoft.com/azure-cognitive-services/speechservices/custom-speech-to-text`. Either append a specific version or append `:latest` to get the most recent version.
 
-The Custom Speech-to-text container image can be found on the `mcr.microsoft.com` container registry syndicate. It resides within the `azure-cognitive-services/speechservices/` repository and is named `custom-speech-to-text`. The fully qualified container image name is `mcr.microsoft.com/azure-cognitive-services/speechservices/custom-speech-to-text`. 
+| Version | Path |
+|-----------|------------|
+| Latest | `mcr.microsoft.com/azure-cognitive-services/speechservices/custom-speech-to-text:latest` |
+| 3.12.0 | `mcr.microsoft.com/azure-cognitive-services/speechservices/custom-speech-to-text:3.12.0-amd64` |
 
-To use the latest version of the container, you can use the `latest` tag. You can also find a full list of [tags on the MCR](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/custom-speech-to-text/tags).
+All tags, except for `latest`, are in the following format and are case sensitive:
 
-| Container | Repository |
-|-----------|------------|
-| Custom speech-to-text | `mcr.microsoft.com/azure-cognitive-services/speechservices/custom-speech-to-text:latest` |
+```
+<major>.<minor>.<patch>-<platform>
+```
 
+> [!NOTE]
+> The `locale` and `voice` for custom speech-to-text containers is determined by the custom model ingested by the container.
+
+The tags are also available [in JSON format](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/custom-speech-to-text/tags/list) for your convenience. The body includes the container path and list of tags. The tags aren't sorted by version, but `"latest"` is always included at the end of the list as shown in this snippet:
+
+```json
+{
+  "name": "azure-cognitive-services/speechservices/custom-speech-to-text",
+  "tags": [
+    "2.10.0-amd64",
+    "2.11.0-amd64",
+    "2.12.0-amd64",
+    "2.12.1-amd64",
+    <--redacted for brevity-->
+    "latest"
+  ]
+}
+```
 
 ### Get the container image with docker pull
 
+You need the [prerequisites](speech-container-howto.md#prerequisites).
 
 Use the [docker pull](https://docs.docker.com/engine/reference/commandline/pull/) command to download a container image from Microsoft Container Registry:
 
-```Docker
+```bash
 docker pull mcr.microsoft.com/azure-cognitive-services/speechservices/custom-speech-to-text:latest
 ```
 
 > [!NOTE]
 > The `locale` and `voice` for custom Speech containers is determined by the custom model ingested by the container.
 
 
-## Use the container
-
-After the container is on the [host computer](speech-container-howto.md#host-computer-requirements-and-recommendations), use the following process to work with the container.
-
-1. [Run the container](#run-the-container-with-docker-run) with the required billing settings. More [examples](speech-container-configuration.md#example-docker-run-commands) of the `docker run` command are available.
-1. [Query the container's prediction endpoint](#query-the-containers-prediction-endpoint).
-
-## Run the container with docker run
-
-Use the [docker run](https://docs.docker.com/engine/reference/commandline/run/) command to run the container. For more information on how to get the `{Endpoint_URI}` and `{API_Key}` values, see [Gather required parameters](speech-container-howto.md#gather-required-parameters). More [examples](speech-container-configuration.md#example-docker-run-commands) of the `docker run` command are also available.
-
-> [!NOTE]
-> For general container requirements, see [Container requirements and recommendations](speech-container-howto.md#container-requirements-and-recommendations).
-
+## Get the custom model ID
 
 The custom speech-to-text container relies on a Custom Speech model. The custom model has to have been [trained](how-to-custom-speech-train-model.md) by using the [Speech Studio](https://aka.ms/speechstudio/customspeech).
 
@@ -78,16 +87,22 @@ Obtain the **Model ID** to use as the argument to the `ModelId` parameter of the
 
 ![Screenshot that shows Custom Speech model details.](media/custom-speech/custom-speech-model-details.png)
 
+## Run the container with docker run
+
+Use the [docker run](https://docs.docker.com/engine/reference/commandline/run/) command to run the container. 
+
 The following table represents the various `docker run` parameters and their corresponding descriptions:
 
 | Parameter | Description |
 |---------|---------|
-| `{VOLUME_MOUNT}` | The host computer [volume mount](https://docs.docker.com/storage/volumes/), which Docker uses to persist the custom model. An example is *C:\CustomSpeech* where the C drive is located on the host machine. |
-| `{MODEL_ID}` | The custom speech model ID. For more information, see [Custom Speech model lifecycle](how-to-custom-speech-model-and-endpoint-lifecycle.md). |
-| `{ENDPOINT_URI}` | The endpoint is required for metering and billing. For more information, see [Gather required parameters](#gather-required-parameters). |
-| `{API_KEY}` | The API key is required. For more information, see [Gather required parameters](#gather-required-parameters). |
+| `{VOLUME_MOUNT}` | The host computer [volume mount](https://docs.docker.com/storage/volumes/), which Docker uses to persist the custom model. An example is `c:\CustomSpeech` where the `c:\` drive is located on the host machine. |
+| `{MODEL_ID}` | The custom speech model ID. For more information, see [Get the custom model ID](#get-the-custom-model-id). |
+| `{ENDPOINT_URI}` | The endpoint is required for metering and billing. For more information, see [billing arguments](speech-container-howto.md#billing-arguments). |
+| `{API_KEY}` | The API key is required. For more information, see [billing arguments](speech-container-howto.md#billing-arguments). |
 
-To run the custom speech-to-text container, execute the following `docker run` command:
+When you run the custom speech-to-text container, configure the port, memory, and CPU according to the custom speech-to-text container [requirements and recommendations](speech-container-howto.md#container-requirements-and-recommendations).
+
+Here's an example `docker run` command with placeholder values. You must specify the `VOLUME_MOUNT`, `MODEL_ID`, `ENDPOINT_URI`, and `API_KEY` values:
 
 ```bash
 docker run --rm -it -p 5000:5000 --memory 8g --cpus 4 \
@@ -109,9 +124,9 @@ This command:
 * If the custom model was previously downloaded, the `ModelId` is ignored.
 * Automatically removes the container after it exits. The container image is still available on the host computer.
 
-#### Base model download on the custom speech-to-text container
+### Base model download on the custom speech-to-text container
 
-Starting in v2.6.0 of the custom-speech-to-text container, you can get the available base model information by using option `BaseModelLocale={LOCALE}`. This option gives you a list of available base models on that locale under your billing account. For example:
+You can get the available base model information by using option `BaseModelLocale={LOCALE}`. This option gives you a list of available base models on that locale under your billing account. For example:
 
 ```bash
 docker run --rm -it \
@@ -145,8 +160,12 @@ Checking available base model for en-us
 2020/10/30 21:54:21 [Fatal] Please run this tool again and assign --modelId '<one above base model id>'. If no model id listed above, it means currently there is no available base model for en-us
 ```
 
-#### Display model download on the custom speech-to-text container
-Starting in v3.1.0 of the custom-speech-to-text container, you can get the available display models information and choose to download those models into your speech-to-text container to get highly improved final display output. 
+### Display model download on the custom speech-to-text container
+
+You can get the available display models information and choose to download those models into your speech-to-text container to get highly improved final display output. 
+
+> [!NOTE] 
+> Display model download is available with custom-speech-to-text container version 3.1.0 and later.
 
 You can query or download any or all of these display model types: Rescoring (`Rescore`), Punctuation (`Punct`), resegmentation (`Resegment`), and wfstitn (`Wfstitn`). Otherwise, you can use the `FullDisplay` option (with or without the other types) to query or download all types of display models. 
 
@@ -190,10 +209,10 @@ ApiKey={API_KEY}
 
 #### Custom pronunciation on the custom speech-to-text container
 
-Starting in v2.5.0 of the custom-speech-to-text container, you can get custom pronunciation results in the output. All you need to do is have your own custom pronunciation rules set up in your custom model and mount the model to a custom-speech-to-text container.
+You can get custom pronunciation results in the output. All you need to do is have your own custom pronunciation rules set up in your custom model and mount the model to a custom-speech-to-text container.
 
 > [!IMPORTANT]
-> The `Eula`, `Billing`, and `ApiKey` options must be specified to run the container. Otherwise, the container won't start. For more information, see [Billing](#billing).
+> The `Eula`, `Billing`, and `ApiKey` options must be specified to run the container. Otherwise, the container won't start. For more information, see [billing arguments](speech-container-howto.md#billing-arguments).
 
 
 ## Use the container
 
@@ -102,6 +102,9 @@ The host is an x64-based computer that runs the Docker container. It can be a co
 * A [Kubernetes](https://kubernetes.io/) cluster deployed to [Azure Stack](/azure-stack/operator). For more information, see [Deploy Kubernetes to Azure Stack](/azure-stack/user/azure-stack-solution-template-kubernetes-deploy).
 
 
+> [!NOTE]
+> Containers support compressed audio input to the Speech SDK by using GStreamer.
+> To install GStreamer in a container, follow Linux instructions for GStreamer in [Use codec compressed audio input with the Speech SDK](how-to-use-codec-compressed-audio-input-streams.md).
 
 ### Advanced Vector Extension support
 
@@ -193,7 +196,7 @@ When you start or run the container, you might experience issues. Use an output
 
 Speech containers come with ASP.NET Core logging support. Here's an example of the `neural-text-to-speech container` started with defaut logging to the console:
 
-```Docker
+```bash
 docker run --rm -it -p 5000:5000 --memory 12g --cpus 6 \
 mcr.microsoft.com/azure-cognitive-services/speechservices/neural-text-to-speech \
 Eula=accept \
 
@@ -18,7 +18,8 @@ keywords: on-premises, Docker, container
 
 By using containers, you can run _some_ of the Azure Cognitive Services Speech service APIs in your own environment. Containers are great for specific security and data governance requirements. In this article, you'll learn how to download, install, and run a Speech container.
 
-With Speech containers, you can build a speech application architecture that's optimized for both robust cloud capabilities and edge locality. Several containers are available, which use the same [pricing](https://azure.microsoft.com/pricing/details/cognitive-services/speech-services/) as the cloud-based Azure Speech services.
+> [!NOTE]
+> You must [request and get approval](speech-container-overview.md#request-approval-to-run-the-container) to use a Speech container. 
 
 ## Available Speech containers
 
@@ -49,7 +50,7 @@ To use the latest version of the container, you can use the `latest` tag. You ca
 
 Use the [docker pull](https://docs.docker.com/engine/reference/commandline/pull/) command to download a container image from Microsoft Container Registry:
 
-```Docker
+```bash
 docker pull mcr.microsoft.com/azure-cognitive-services/speechservices/language-detection:latest
 ```
 
@@ -86,7 +87,7 @@ This command:
 
 If you want to run this container with the speech-to-text container, you can use this [docker image](https://hub.docker.com/r/antsu/on-prem-client). After both containers have been started, use this `docker run` command to execute `speech-to-text-with-languagedetection-client`:
 
-```Docker
+```bash
 docker run --rm -v ${HOME}:/root -ti antsu/on-prem-client:latest ./speech-to-text-with-languagedetection-client ./audio/LanguageDetection_en-us.wav --host localhost --lport 5003 --sport 5000
 ```
 
 
@@ -18,7 +18,8 @@ keywords: on-premises, Docker, container
 
 By using containers, you can run _some_ of the Azure Cognitive Services Speech service APIs in your own environment. Containers are great for specific security and data governance requirements. In this article, you'll learn how to download, install, and run a Speech container.
 
-With Speech containers, you can build a speech application architecture that's optimized for both robust cloud capabilities and edge locality. Several containers are available, which use the same [pricing](https://azure.microsoft.com/pricing/details/cognitive-services/speech-services/) as the cloud-based Azure Speech services.
+> [!NOTE]
+> You must [request and get approval](speech-container-overview.md#request-approval-to-run-the-container) to use a Speech container. 
 
 ## Available Speech containers
 
@@ -46,7 +47,7 @@ To use the latest version of the container, you can use the `latest` tag. You ca
 
 Use the [docker pull](https://docs.docker.com/engine/reference/commandline/pull/) command to download a container image from Microsoft Container Registry:
 
-```Docker
+```bash
 docker pull mcr.microsoft.com/azure-cognitive-services/speechservices/neural-text-to-speech:latest
 ```
 
 
@@ -23,7 +23,7 @@ By using containers, you can use a subset of the Speech service features in your
 
 ## Request approval to run the container
 
-To use the Speech containers, you must submit a request form and wait for approval. Fill out and submit a [request form](https://aka.ms/csgate) to request access to the container. 
+To use the Speech containers, you must submit a request form and wait for approval. Fill out and submit a request form to request access to the container. 
 * For connected containers, you must submit [this request form](https://aka.ms/csgate) and wait for approval.
 * For disconnected containers (not connected to the internet), you must submit [this request form](https://aka.ms/csdisconnectedcontainers) and wait for approval. For more information about applying and purchasing a commitment plan to use containers in disconnected environments, see [Use containers in disconnected environments](../containers/disconnected-containers.md#request-access-to-use-containers-in-disconnected-environments).