Merge pull request #936 from eric-urban/eur/containers-rn

prmerger-automator[bot] · web-flow · commit cc0749019fba · 2024-10-21T19:24:06.000Z
speech containers release notes
diff --git a/articles/ai-services/speech-service/includes/release-notes/release-notes-containers.md b/articles/ai-services/speech-service/includes/release-notes/release-notes-containers.md
@@ -2,17 +2,31 @@
 author: eric-urban
 ms.service: azure-ai-speech
 ms.topic: include
-ms.date: 9/17/2024
+ms.date: 10/21/2024
 ms.author: eur
 ---
 
+
+### 2024-October release
+
+Add support for the latest model versions:
+- Speech language identification 1.16.0
+- Neural text to speech 3.5.0
+    - Make `en-us-ariacpuneural` an alias to `en-us-jessacpuneural`
+    - Update the text to speech backend engine version
+- Speech to text 4.10.0
+    - Restore support for locale `uk-UA`
+    - Fix silence settings to work with long periods of silence in the audio
+    - Replace deprecated models: `cs-CZ`, `da-DK`, `en-GB`, `fr-CA`, `hu-HU`, `it-CH`, `tr-TR`, `zh-CN-sichuan`
+- Custom speech to text 4.10.0
+
 ### 2024-September release
 
 Add support for the latest model versions:
 - Speech language identification 1.15.0
     - Mitigate Vulnerabilities
 - Neural text to speech 3.4.0
-    -  New voices: `en-us-andrewmultilingualneural`, `en-us-jessaneural`, `es-us-alonsoneural`, `es-us-palomaneural`, `it-it-isabellamultilingualneural`
+    - New voices: `en-us-andrewmultilingualneural`, `en-us-jessaneural`, `es-us-alonsoneural`, `es-us-palomaneural`, `it-it-isabellamultilingualneural`
     - Mitigate Vulnerabilities
 - Speech to text 4.9.0
     - New Locales: `ar-YE`, `af-ZA`, `am-ET`, `ar-MA`, `ar-TN`, `sw-KE`, `sw-TZ`, `zu-ZA`
diff --git a/articles/ai-services/speech-service/speech-container-cstt.md b/articles/ai-services/speech-service/speech-container-cstt.md
@@ -30,7 +30,7 @@ The fully qualified container image name is, `mcr.microsoft.com/azure-cognitive-
 | Version | Path |
 |-----------|------------|
 | Latest | `mcr.microsoft.com/azure-cognitive-services/speechservices/custom-speech-to-text:latest` |
-| 4.9.0 | `mcr.microsoft.com/azure-cognitive-services/speechservices/custom-speech-to-text:4.9.0-amd64` |
+| 4.10.0 | `mcr.microsoft.com/azure-cognitive-services/speechservices/custom-speech-to-text:4.10.0-amd64` |
 
 All tags, except for `latest`, are in the following format and are case sensitive:
 
@@ -54,6 +54,7 @@ The tags are also available [in JSON format](https://mcr.microsoft.com/v2/azure-
     "4.7.0-amd64",
     "4.8.0-amd64",
     "4.9.0-amd64",
+    "4.10.0-amd64",
     "latest"
   ]
 }
diff --git a/articles/ai-services/speech-service/speech-container-lid.md b/articles/ai-services/speech-service/speech-container-lid.md
@@ -37,7 +37,7 @@ The fully qualified container image name is, `mcr.microsoft.com/azure-cognitive-
 | Version | Path |
 |-----------|------------|
 | Latest | `mcr.microsoft.com/azure-cognitive-services/speechservices/language-detection:latest` |
-| 1.15.0 | `mcr.microsoft.com/azure-cognitive-services/speechservices/language-detection:1.15.0-amd64-preview` |
+| 1.16.0 | `mcr.microsoft.com/azure-cognitive-services/speechservices/language-detection:1.16.0-amd64-preview` |
 
 All tags, except for `latest`, are in the following format and are case sensitive:
 
@@ -57,6 +57,7 @@ The tags are also available [in JSON format](https://mcr.microsoft.com/v2/azure-
     "1.13.0-amd64-preview",
     "1.14.0-amd64-preview",
     "1.15.0-amd64-preview",
+    "1.16.0-amd64-preview",
     "1.3.0-amd64-preview",
     "1.5.0-amd64-preview",
     "1.6.1-amd64-preview",
diff --git a/articles/ai-services/speech-service/speech-container-ntts.md b/articles/ai-services/speech-service/speech-container-ntts.md
@@ -31,7 +31,7 @@ The fully qualified container image name is, `mcr.microsoft.com/azure-cognitive-
 | Version | Path |
 |-----------|------------|
 | Latest | `mcr.microsoft.com/azure-cognitive-services/speechservices/neural-text-to-speech:latest`<br/><br/>The `latest` tag pulls the `en-US` locale and `en-us-arianeural` voice. |
-| 3.4.0 | `mcr.microsoft.com/azure-cognitive-services/speechservices/neural-text-to-speech:3.4.0-amd64-en-us-arianeural` |
+| 3.5.0 | `mcr.microsoft.com/azure-cognitive-services/speechservices/neural-text-to-speech:3.5.0-amd64-en-us-arianeural` |
 
 All tags, except for `latest`, are in the following format and are case sensitive:
 
@@ -46,19 +46,19 @@ The tags are also available [in JSON format](https://mcr.microsoft.com/v2/azure-
   "name": "azure-cognitive-services/speechservices/neural-text-to-speech",
   "tags": [
     <--redacted for brevity-->
-    "3.4.0-amd64-uk-ua-ostapneural",
-    "3.4.0-amd64-zh-cn-xiaochenneural-preview",
-    "3.4.0-amd64-zh-cn-xiaohanneural",
-    "3.4.0-amd64-zh-cn-xiaomoneural",
-    "3.4.0-amd64-zh-cn-xiaoqiuneural-preview",
-    "3.4.0-amd64-zh-cn-xiaoruineural",
-    "3.4.0-amd64-zh-cn-xiaoshuangneural-preview",
-    "3.4.0-amd64-zh-cn-xiaoxiaoneural",
-    "3.4.0-amd64-zh-cn-xiaoyanneural-preview",
-    "3.4.0-amd64-zh-cn-xiaoyouneural",
-    "3.4.0-amd64-zh-cn-yunxineural",
-    "3.4.0-amd64-zh-cn-yunyangneural",
-    "3.4.0-amd64-zh-cn-yunyeneural",
+    "3.5.0-amd64-uk-ua-ostapneural",
+    "3.5.0-amd64-zh-cn-xiaochenneural-preview",
+    "3.5.0-amd64-zh-cn-xiaohanneural",
+    "3.5.0-amd64-zh-cn-xiaomoneural",
+    "3.5.0-amd64-zh-cn-xiaoqiuneural-preview",
+    "3.5.0-amd64-zh-cn-xiaoruineural",
+    "3.5.0-amd64-zh-cn-xiaoshuangneural-preview",
+    "3.5.0-amd64-zh-cn-xiaoxiaoneural",
+    "3.5.0-amd64-zh-cn-xiaoyanneural-preview",
+    "3.5.0-amd64-zh-cn-xiaoyouneural",
+    "3.5.0-amd64-zh-cn-yunxineural",
+    "3.5.0-amd64-zh-cn-yunyangneural",
+    "3.5.0-amd64-zh-cn-yunyeneural",
     "latest"
   ]
 }
diff --git a/articles/ai-services/speech-service/speech-container-overview.md b/articles/ai-services/speech-service/speech-container-overview.md
@@ -22,10 +22,10 @@ The following table lists the Speech containers available in the Microsoft Conta
 
 | Container | Features | Supported versions and locales |
 |--|--|--|
-| [Speech to text](speech-container-stt.md) | Transcribes continuous real-time speech or batch audio recordings with intermediate results.  | Latest: 4.9.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/speech-to-text/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/speech-to-text/tags/list).|
+| [Speech to text](speech-container-stt.md) | Transcribes continuous real-time speech or batch audio recordings with intermediate results.  | Latest: 4.10.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/speech-to-text/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/speech-to-text/tags/list).|
 | [Custom speech to text](speech-container-cstt.md) | Using a custom model from the [custom speech portal](https://speech.microsoft.com/customspeech), transcribes continuous real-time speech or batch audio recordings into text with intermediate results. | Latest: 4.8.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/custom-speech-to-text/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/speech-to-text/tags/list). |
-| [Speech language identification](speech-container-lid.md)<sup>1, 2</sup> | Detects the language spoken in audio files. | Latest: 1.15.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/language-detection/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/language-detection/tags/list). |
-| [Neural text to speech](speech-container-ntts.md) | Converts text to natural-sounding speech by using deep neural network technology, which allows for more natural synthesized speech. | Latest: 3.4.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/neural-text-to-speech/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/neural-text-to-speech/tags/list). |
+| [Speech language identification](speech-container-lid.md)<sup>1, 2</sup> | Detects the language spoken in audio files. | Latest: 1.16.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/language-detection/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/language-detection/tags/list). |
+| [Neural text to speech](speech-container-ntts.md) | Converts text to natural-sounding speech by using deep neural network technology, which allows for more natural synthesized speech. | Latest: 3.5.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/neural-text-to-speech/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/neural-text-to-speech/tags/list). |
 
 <sup>1</sup> The container is available in public preview. Containers in preview are still under development and don't meet Microsoft's stability and support requirements.
 <sup>2</sup> Not available as a disconnected container.
diff --git a/articles/ai-services/speech-service/speech-container-stt.md b/articles/ai-services/speech-service/speech-container-stt.md
@@ -31,7 +31,7 @@ The fully qualified container image name is, `mcr.microsoft.com/azure-cognitive-
 | Version | Path |
 |-----------|------------|
 | Latest | `mcr.microsoft.com/azure-cognitive-services/speechservices/speech-to-text:latest`<br/><br/>The `latest` tag pulls the latest image for the `en-US` locale. |
-| 4.9.0 | `mcr.microsoft.com/azure-cognitive-services/speechservices/speech-to-text:4.9.0-amd64-mr-in` |
+| 4.10.0 | `mcr.microsoft.com/azure-cognitive-services/speechservices/speech-to-text:4.10.0-amd64-mr-in` |
 
 All tags, except for `latest`, are in the following format and are case sensitive:
 
@@ -46,18 +46,18 @@ The tags are also available [in JSON format](https://mcr.microsoft.com/v2/azure-
   "name": "azure-cognitive-services/speechservices/speech-to-text",
   "tags": [
     <--redacted for brevity-->    
-    "4.9.0-amd64-sw-tz",
-    "4.9.0-amd64-ta-in",
-    "4.9.0-amd64-th-th",
-    "4.9.0-amd64-tr-tr",
-    "4.9.0-amd64-vi-vn",
-    "4.9.0-amd64-wuu-cn",
-    "4.9.0-amd64-yue-cn",
-    "4.9.0-amd64-zh-cn",
-    "4.9.0-amd64-zh-cn-sichuan",
-    "4.9.0-amd64-zh-hk",
-    "4.9.0-amd64-zh-tw",
-    "4.9.0-amd64-zu-za",
+    "4.10.0-amd64-sw-tz",
+    "4.10.0-amd64-ta-in",
+    "4.10.0-amd64-th-th",
+    "4.10.0-amd64-tr-tr",
+    "4.10.0-amd64-vi-vn",
+    "4.10.0-amd64-wuu-cn",
+    "4.10.0-amd64-yue-cn",
+    "4.10.0-amd64-zh-cn",
+    "4.10.0-amd64-zh-cn-sichuan",
+    "4.10.0-amd64-zh-hk",
+    "4.10.0-amd64-zh-tw",
+    "4.10.0-amd64-zu-za",
     "latest"
   ]
 }

Original file line number	Diff line number	Diff line change
@@ -30,7 +30,7 @@ The fully qualified container image name is, `mcr.microsoft.com/azure-cognitive-
`30`	`30`	`\| Version \| Path \|`
`31`	`31`	`\|-----------\|------------\|`
`32`	`32`	\| Latest \| `mcr.microsoft.com/azure-cognitive-services/speechservices/custom-speech-to-text:latest` \|
`33`		-\| 4.9.0 \| `mcr.microsoft.com/azure-cognitive-services/speechservices/custom-speech-to-text:4.9.0-amd64` \|
	`33`	+\| 4.10.0 \| `mcr.microsoft.com/azure-cognitive-services/speechservices/custom-speech-to-text:4.10.0-amd64` \|
`34`	`34`
`35`	`35`	All tags, except for `latest`, are in the following format and are case sensitive:
`36`	`36`
`@@ -54,6 +54,7 @@ The tags are also available [in JSON format](https://mcr.microsoft.com/v2/azure-`
`54`	`54`	`"4.7.0-amd64",`
`55`	`55`	`"4.8.0-amd64",`
`56`	`56`	`"4.9.0-amd64",`
	`57`	`+ "4.10.0-amd64",`
`57`	`58`	`"latest"`
`58`	`59`	`]`
`59`	`60`	`}`