Skip to content

Commit 468847f

Browse files
committed
update speech containers versions
1 parent 5ca33aa commit 468847f

File tree

6 files changed

+45
-34
lines changed

6 files changed

+45
-34
lines changed

articles/ai-services/speech-service/includes/release-notes/release-notes-containers.md

Lines changed: 8 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,10 +2,17 @@
22
author: eric-urban
33
ms.service: azure-ai-speech
44
ms.topic: include
5-
ms.date: 10/21/2024
5+
ms.date: 2/24/2025
66
ms.author: eur
77
---
88

9+
### 2025-February release
10+
11+
Add support for the latest model versions:
12+
- Speech language identification 1.18.0
13+
- Neural text to speech 3.7.0
14+
- Speech to text 4.12.0
15+
- Custom speech to text 4.12.0
916

1017
### 2024-October release
1118

articles/ai-services/speech-service/speech-container-cstt.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -30,7 +30,7 @@ The fully qualified container image name is, `mcr.microsoft.com/azure-cognitive-
3030
| Version | Path |
3131
|-----------|------------|
3232
| Latest | `mcr.microsoft.com/azure-cognitive-services/speechservices/custom-speech-to-text:latest` |
33-
| 4.10.0 | `mcr.microsoft.com/azure-cognitive-services/speechservices/custom-speech-to-text:4.10.0-amd64` |
33+
| 4.12.0 | `mcr.microsoft.com/azure-cognitive-services/speechservices/custom-speech-to-text:4.12.0-amd64` |
3434

3535
All tags, except for `latest`, are in the following format and are case sensitive:
3636

@@ -55,6 +55,8 @@ The tags are also available [in JSON format](https://mcr.microsoft.com/v2/azure-
5555
"4.8.0-amd64",
5656
"4.9.0-amd64",
5757
"4.10.0-amd64",
58+
"4.11.0-amd64",
59+
"4.12.0-amd64",
5860
"latest"
5961
]
6062
}

articles/ai-services/speech-service/speech-container-lid.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -37,7 +37,7 @@ The fully qualified container image name is, `mcr.microsoft.com/azure-cognitive-
3737
| Version | Path |
3838
|-----------|------------|
3939
| Latest | `mcr.microsoft.com/azure-cognitive-services/speechservices/language-detection:latest` |
40-
| 1.16.0 | `mcr.microsoft.com/azure-cognitive-services/speechservices/language-detection:1.16.0-amd64-preview` |
40+
| 1.18.0 | `mcr.microsoft.com/azure-cognitive-services/speechservices/language-detection:1.18.0-amd64-preview` |
4141

4242
All tags, except for `latest`, are in the following format and are case sensitive:
4343

@@ -58,6 +58,8 @@ The tags are also available [in JSON format](https://mcr.microsoft.com/v2/azure-
5858
"1.14.0-amd64-preview",
5959
"1.15.0-amd64-preview",
6060
"1.16.0-amd64-preview",
61+
"1.17.0-amd64-preview",
62+
"1.18.0-amd64-preview",
6163
"1.3.0-amd64-preview",
6264
"1.5.0-amd64-preview",
6365
"1.6.1-amd64-preview",

articles/ai-services/speech-service/speech-container-ntts.md

Lines changed: 14 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -31,7 +31,7 @@ The fully qualified container image name is, `mcr.microsoft.com/azure-cognitive-
3131
| Version | Path |
3232
|-----------|------------|
3333
| Latest | `mcr.microsoft.com/azure-cognitive-services/speechservices/neural-text-to-speech:latest`<br/><br/>The `latest` tag pulls the `en-US` locale and `en-us-arianeural` voice. |
34-
| 3.5.0 | `mcr.microsoft.com/azure-cognitive-services/speechservices/neural-text-to-speech:3.5.0-amd64-en-us-arianeural` |
34+
| 3.7.0 | `mcr.microsoft.com/azure-cognitive-services/speechservices/neural-text-to-speech:3.7.0-amd64-en-us-arianeural` |
3535

3636
All tags, except for `latest`, are in the following format and are case sensitive:
3737

@@ -46,19 +46,19 @@ The tags are also available [in JSON format](https://mcr.microsoft.com/v2/azure-
4646
"name": "azure-cognitive-services/speechservices/neural-text-to-speech",
4747
"tags": [
4848
<--redacted for brevity-->
49-
"3.5.0-amd64-uk-ua-ostapneural",
50-
"3.5.0-amd64-zh-cn-xiaochenneural-preview",
51-
"3.5.0-amd64-zh-cn-xiaohanneural",
52-
"3.5.0-amd64-zh-cn-xiaomoneural",
53-
"3.5.0-amd64-zh-cn-xiaoqiuneural-preview",
54-
"3.5.0-amd64-zh-cn-xiaoruineural",
55-
"3.5.0-amd64-zh-cn-xiaoshuangneural-preview",
56-
"3.5.0-amd64-zh-cn-xiaoxiaoneural",
57-
"3.5.0-amd64-zh-cn-xiaoyanneural-preview",
58-
"3.5.0-amd64-zh-cn-xiaoyouneural",
59-
"3.5.0-amd64-zh-cn-yunxineural",
60-
"3.5.0-amd64-zh-cn-yunyangneural",
61-
"3.5.0-amd64-zh-cn-yunyeneural",
49+
"3.7.0-amd64-uk-ua-ostapneural",
50+
"3.7.0-amd64-zh-cn-xiaochenneural-preview",
51+
"3.7.0-amd64-zh-cn-xiaohanneural",
52+
"3.7.0-amd64-zh-cn-xiaomoneural",
53+
"3.7.0-amd64-zh-cn-xiaoqiuneural-preview",
54+
"3.7.0-amd64-zh-cn-xiaoruineural",
55+
"3.7.0-amd64-zh-cn-xiaoshuangneural-preview",
56+
"3.7.0-amd64-zh-cn-xiaoxiaoneural",
57+
"3.7.0-amd64-zh-cn-xiaoyanneural-preview",
58+
"3.7.0-amd64-zh-cn-xiaoyouneural",
59+
"3.7.0-amd64-zh-cn-yunxineural",
60+
"3.7.0-amd64-zh-cn-yunyangneural",
61+
"3.7.0-amd64-zh-cn-yunyeneural",
6262
"latest"
6363
]
6464
}

articles/ai-services/speech-service/speech-container-overview.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -22,10 +22,10 @@ The following table lists the Speech containers available in the Microsoft Conta
2222

2323
| Container | Features | Supported versions and locales |
2424
|--|--|--|
25-
| [Speech to text](speech-container-stt.md) | Transcribes continuous real-time speech or batch audio recordings with intermediate results. | Latest: 4.10.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/speech-to-text/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/speech-to-text/tags/list).|
26-
| [Custom speech to text](speech-container-cstt.md) | Using a custom model from the [custom speech portal](https://speech.microsoft.com/customspeech), transcribes continuous real-time speech or batch audio recordings into text with intermediate results. | Latest: 4.8.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/custom-speech-to-text/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/speech-to-text/tags/list). |
27-
| [Speech language identification](speech-container-lid.md)<sup>1, 2</sup> | Detects the language spoken in audio files. | Latest: 1.16.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/language-detection/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/language-detection/tags/list). |
28-
| [Neural text to speech](speech-container-ntts.md) | Converts text to natural-sounding speech by using deep neural network technology, which allows for more natural synthesized speech. | Latest: 3.5.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/neural-text-to-speech/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/neural-text-to-speech/tags/list). |
25+
| [Speech to text](speech-container-stt.md) | Transcribes continuous real-time speech or batch audio recordings with intermediate results. | Latest: 4.12.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/speech-to-text/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/speech-to-text/tags/list).|
26+
| [Custom speech to text](speech-container-cstt.md) | Using a custom model from the [custom speech portal](https://speech.microsoft.com/customspeech), transcribes continuous real-time speech or batch audio recordings into text with intermediate results. | Latest: 4.12.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/custom-speech-to-text/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/speech-to-text/tags/list). |
27+
| [Speech language identification](speech-container-lid.md)<sup>1, 2</sup> | Detects the language spoken in audio files. | Latest: 1.18.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/language-detection/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/language-detection/tags/list). |
28+
| [Neural text to speech](speech-container-ntts.md) | Converts text to natural-sounding speech by using deep neural network technology, which allows for more natural synthesized speech. | Latest: 3.7.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/neural-text-to-speech/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/neural-text-to-speech/tags/list). |
2929

3030
<sup>1</sup> The container is available in public preview. Containers in preview are still under development and don't meet Microsoft's stability and support requirements.
3131
<sup>2</sup> Not available as a disconnected container.

articles/ai-services/speech-service/speech-container-stt.md

Lines changed: 13 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -31,7 +31,7 @@ The fully qualified container image name is, `mcr.microsoft.com/azure-cognitive-
3131
| Version | Path |
3232
|-----------|------------|
3333
| Latest | `mcr.microsoft.com/azure-cognitive-services/speechservices/speech-to-text:latest`<br/><br/>The `latest` tag pulls the latest image for the `en-US` locale. |
34-
| 4.10.0 | `mcr.microsoft.com/azure-cognitive-services/speechservices/speech-to-text:4.10.0-amd64-mr-in` |
34+
| 4.12.0 | `mcr.microsoft.com/azure-cognitive-services/speechservices/speech-to-text:4.12.0-amd64-mr-in` |
3535

3636
All tags, except for `latest`, are in the following format and are case sensitive:
3737

@@ -46,18 +46,18 @@ The tags are also available [in JSON format](https://mcr.microsoft.com/v2/azure-
4646
"name": "azure-cognitive-services/speechservices/speech-to-text",
4747
"tags": [
4848
<--redacted for brevity-->
49-
"4.10.0-amd64-sw-tz",
50-
"4.10.0-amd64-ta-in",
51-
"4.10.0-amd64-th-th",
52-
"4.10.0-amd64-tr-tr",
53-
"4.10.0-amd64-vi-vn",
54-
"4.10.0-amd64-wuu-cn",
55-
"4.10.0-amd64-yue-cn",
56-
"4.10.0-amd64-zh-cn",
57-
"4.10.0-amd64-zh-cn-sichuan",
58-
"4.10.0-amd64-zh-hk",
59-
"4.10.0-amd64-zh-tw",
60-
"4.10.0-amd64-zu-za",
49+
"4.12.0-amd64-sw-tz",
50+
"4.12.0-amd64-ta-in",
51+
"4.12.0-amd64-th-th",
52+
"4.12.0-amd64-tr-tr",
53+
"4.12.0-amd64-vi-vn",
54+
"4.12.0-amd64-wuu-cn",
55+
"4.12.0-amd64-yue-cn",
56+
"4.12.0-amd64-zh-cn",
57+
"4.12.0-amd64-zh-cn-sichuan",
58+
"4.12.0-amd64-zh-hk",
59+
"4.12.0-amd64-zh-tw",
60+
"4.12.0-amd64-zu-za",
6161
"latest"
6262
]
6363
}

0 commit comments

Comments
 (0)