update speech containers versions

eric-urban · eric-urban · commit 468847fee229 · 2025-02-24T17:30:40.000-08:00
diff --git a/articles/ai-services/speech-service/includes/release-notes/release-notes-containers.md b/articles/ai-services/speech-service/includes/release-notes/release-notes-containers.md
@@ -2,10 +2,17 @@
 author: eric-urban
 ms.service: azure-ai-speech
 ms.topic: include
-ms.date: 10/21/2024
+ms.date: 2/24/2025
 ms.author: eur
 ---
 
+### 2025-February release
+
+Add support for the latest model versions:
+- Speech language identification 1.18.0
+- Neural text to speech 3.7.0
+- Speech to text 4.12.0
+- Custom speech to text 4.12.0
 
 ### 2024-October release
 
diff --git a/articles/ai-services/speech-service/speech-container-cstt.md b/articles/ai-services/speech-service/speech-container-cstt.md
@@ -30,7 +30,7 @@ The fully qualified container image name is, `mcr.microsoft.com/azure-cognitive-
 | Version | Path |
 |-----------|------------|
 | Latest | `mcr.microsoft.com/azure-cognitive-services/speechservices/custom-speech-to-text:latest` |
-| 4.10.0 | `mcr.microsoft.com/azure-cognitive-services/speechservices/custom-speech-to-text:4.10.0-amd64` |
+| 4.12.0 | `mcr.microsoft.com/azure-cognitive-services/speechservices/custom-speech-to-text:4.12.0-amd64` |
 
 All tags, except for `latest`, are in the following format and are case sensitive:
 
@@ -55,6 +55,8 @@ The tags are also available [in JSON format](https://mcr.microsoft.com/v2/azure-
     "4.8.0-amd64",
     "4.9.0-amd64",
     "4.10.0-amd64",
+    "4.11.0-amd64",
+    "4.12.0-amd64",
     "latest"
   ]
 }
diff --git a/articles/ai-services/speech-service/speech-container-lid.md b/articles/ai-services/speech-service/speech-container-lid.md
@@ -37,7 +37,7 @@ The fully qualified container image name is, `mcr.microsoft.com/azure-cognitive-
 | Version | Path |
 |-----------|------------|
 | Latest | `mcr.microsoft.com/azure-cognitive-services/speechservices/language-detection:latest` |
-| 1.16.0 | `mcr.microsoft.com/azure-cognitive-services/speechservices/language-detection:1.16.0-amd64-preview` |
+| 1.18.0 | `mcr.microsoft.com/azure-cognitive-services/speechservices/language-detection:1.18.0-amd64-preview` |
 
 All tags, except for `latest`, are in the following format and are case sensitive:
 
@@ -58,6 +58,8 @@ The tags are also available [in JSON format](https://mcr.microsoft.com/v2/azure-
     "1.14.0-amd64-preview",
     "1.15.0-amd64-preview",
     "1.16.0-amd64-preview",
+    "1.17.0-amd64-preview",
+    "1.18.0-amd64-preview",
     "1.3.0-amd64-preview",
     "1.5.0-amd64-preview",
     "1.6.1-amd64-preview",
diff --git a/articles/ai-services/speech-service/speech-container-ntts.md b/articles/ai-services/speech-service/speech-container-ntts.md
@@ -31,7 +31,7 @@ The fully qualified container image name is, `mcr.microsoft.com/azure-cognitive-
 | Version | Path |
 |-----------|------------|
 | Latest | `mcr.microsoft.com/azure-cognitive-services/speechservices/neural-text-to-speech:latest`<br/><br/>The `latest` tag pulls the `en-US` locale and `en-us-arianeural` voice. |
-| 3.5.0 | `mcr.microsoft.com/azure-cognitive-services/speechservices/neural-text-to-speech:3.5.0-amd64-en-us-arianeural` |
+| 3.7.0 | `mcr.microsoft.com/azure-cognitive-services/speechservices/neural-text-to-speech:3.7.0-amd64-en-us-arianeural` |
 
 All tags, except for `latest`, are in the following format and are case sensitive:
 
@@ -46,19 +46,19 @@ The tags are also available [in JSON format](https://mcr.microsoft.com/v2/azure-
   "name": "azure-cognitive-services/speechservices/neural-text-to-speech",
   "tags": [
     <--redacted for brevity-->
-    "3.5.0-amd64-uk-ua-ostapneural",
-    "3.5.0-amd64-zh-cn-xiaochenneural-preview",
-    "3.5.0-amd64-zh-cn-xiaohanneural",
-    "3.5.0-amd64-zh-cn-xiaomoneural",
-    "3.5.0-amd64-zh-cn-xiaoqiuneural-preview",
-    "3.5.0-amd64-zh-cn-xiaoruineural",
-    "3.5.0-amd64-zh-cn-xiaoshuangneural-preview",
-    "3.5.0-amd64-zh-cn-xiaoxiaoneural",
-    "3.5.0-amd64-zh-cn-xiaoyanneural-preview",
-    "3.5.0-amd64-zh-cn-xiaoyouneural",
-    "3.5.0-amd64-zh-cn-yunxineural",
-    "3.5.0-amd64-zh-cn-yunyangneural",
-    "3.5.0-amd64-zh-cn-yunyeneural",
+    "3.7.0-amd64-uk-ua-ostapneural",
+    "3.7.0-amd64-zh-cn-xiaochenneural-preview",
+    "3.7.0-amd64-zh-cn-xiaohanneural",
+    "3.7.0-amd64-zh-cn-xiaomoneural",
+    "3.7.0-amd64-zh-cn-xiaoqiuneural-preview",
+    "3.7.0-amd64-zh-cn-xiaoruineural",
+    "3.7.0-amd64-zh-cn-xiaoshuangneural-preview",
+    "3.7.0-amd64-zh-cn-xiaoxiaoneural",
+    "3.7.0-amd64-zh-cn-xiaoyanneural-preview",
+    "3.7.0-amd64-zh-cn-xiaoyouneural",
+    "3.7.0-amd64-zh-cn-yunxineural",
+    "3.7.0-amd64-zh-cn-yunyangneural",
+    "3.7.0-amd64-zh-cn-yunyeneural",
     "latest"
   ]
 }
diff --git a/articles/ai-services/speech-service/speech-container-overview.md b/articles/ai-services/speech-service/speech-container-overview.md
@@ -22,10 +22,10 @@ The following table lists the Speech containers available in the Microsoft Conta
 
 | Container | Features | Supported versions and locales |
 |--|--|--|
-| [Speech to text](speech-container-stt.md) | Transcribes continuous real-time speech or batch audio recordings with intermediate results.  | Latest: 4.10.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/speech-to-text/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/speech-to-text/tags/list).|
-| [Custom speech to text](speech-container-cstt.md) | Using a custom model from the [custom speech portal](https://speech.microsoft.com/customspeech), transcribes continuous real-time speech or batch audio recordings into text with intermediate results. | Latest: 4.8.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/custom-speech-to-text/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/speech-to-text/tags/list). |
-| [Speech language identification](speech-container-lid.md)<sup>1, 2</sup> | Detects the language spoken in audio files. | Latest: 1.16.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/language-detection/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/language-detection/tags/list). |
-| [Neural text to speech](speech-container-ntts.md) | Converts text to natural-sounding speech by using deep neural network technology, which allows for more natural synthesized speech. | Latest: 3.5.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/neural-text-to-speech/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/neural-text-to-speech/tags/list). |
+| [Speech to text](speech-container-stt.md) | Transcribes continuous real-time speech or batch audio recordings with intermediate results.  | Latest: 4.12.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/speech-to-text/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/speech-to-text/tags/list).|
+| [Custom speech to text](speech-container-cstt.md) | Using a custom model from the [custom speech portal](https://speech.microsoft.com/customspeech), transcribes continuous real-time speech or batch audio recordings into text with intermediate results. | Latest: 4.12.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/custom-speech-to-text/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/speech-to-text/tags/list). |
+| [Speech language identification](speech-container-lid.md)<sup>1, 2</sup> | Detects the language spoken in audio files. | Latest: 1.18.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/language-detection/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/language-detection/tags/list). |
+| [Neural text to speech](speech-container-ntts.md) | Converts text to natural-sounding speech by using deep neural network technology, which allows for more natural synthesized speech. | Latest: 3.7.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/neural-text-to-speech/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/neural-text-to-speech/tags/list). |
 
 <sup>1</sup> The container is available in public preview. Containers in preview are still under development and don't meet Microsoft's stability and support requirements.
 <sup>2</sup> Not available as a disconnected container.
diff --git a/articles/ai-services/speech-service/speech-container-stt.md b/articles/ai-services/speech-service/speech-container-stt.md
@@ -31,7 +31,7 @@ The fully qualified container image name is, `mcr.microsoft.com/azure-cognitive-
 | Version | Path |
 |-----------|------------|
 | Latest | `mcr.microsoft.com/azure-cognitive-services/speechservices/speech-to-text:latest`<br/><br/>The `latest` tag pulls the latest image for the `en-US` locale. |
-| 4.10.0 | `mcr.microsoft.com/azure-cognitive-services/speechservices/speech-to-text:4.10.0-amd64-mr-in` |
+| 4.12.0 | `mcr.microsoft.com/azure-cognitive-services/speechservices/speech-to-text:4.12.0-amd64-mr-in` |
 
 All tags, except for `latest`, are in the following format and are case sensitive:
 
@@ -46,18 +46,18 @@ The tags are also available [in JSON format](https://mcr.microsoft.com/v2/azure-
   "name": "azure-cognitive-services/speechservices/speech-to-text",
   "tags": [
     <--redacted for brevity-->    
-    "4.10.0-amd64-sw-tz",
-    "4.10.0-amd64-ta-in",
-    "4.10.0-amd64-th-th",
-    "4.10.0-amd64-tr-tr",
-    "4.10.0-amd64-vi-vn",
-    "4.10.0-amd64-wuu-cn",
-    "4.10.0-amd64-yue-cn",
-    "4.10.0-amd64-zh-cn",
-    "4.10.0-amd64-zh-cn-sichuan",
-    "4.10.0-amd64-zh-hk",
-    "4.10.0-amd64-zh-tw",
-    "4.10.0-amd64-zu-za",
+    "4.12.0-amd64-sw-tz",
+    "4.12.0-amd64-ta-in",
+    "4.12.0-amd64-th-th",
+    "4.12.0-amd64-tr-tr",
+    "4.12.0-amd64-vi-vn",
+    "4.12.0-amd64-wuu-cn",
+    "4.12.0-amd64-yue-cn",
+    "4.12.0-amd64-zh-cn",
+    "4.12.0-amd64-zh-cn-sichuan",
+    "4.12.0-amd64-zh-hk",
+    "4.12.0-amd64-zh-tw",
+    "4.12.0-amd64-zu-za",
     "latest"
   ]
 }