You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/cognitive-services/Speech-Service/includes/release-notes/release-notes-stt.md
+17-10Lines changed: 17 additions & 10 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -2,17 +2,24 @@
2
2
author: eric-urban
3
3
ms.service: cognitive-services
4
4
ms.topic: include
5
-
ms.date: 01/08/2022
5
+
ms.date: 12/08/2022
6
6
ms.author: eur
7
7
---
8
-
### 2022-Oct release
9
8
10
-
#### New Speech-to-text-locale
9
+
### December 2022 release
10
+
11
+
#### Speech-to-text REST API
12
+
13
+
The speech-to-text REST API version 3.1 is generally available. Version 3.0 of the [speech-to-text REST API](../../rest-speech-to-text.md) will be retired. For more information about how to migrate, see the [guide](../../migrate-v3-0-to-v3-1.md).
14
+
15
+
### October 2022 release
16
+
17
+
#### New speech-to-text locale
11
18
12
19
Added support for Malayalam (India) with the `ml-IN` locale. See the complete language list [here](../../language-support.md?tabs=stt-tts).
13
20
14
21
15
-
### 2022-July release
22
+
### July 2022 release
16
23
17
24
#### New Speech-to-text-locales:
18
25
@@ -29,7 +36,7 @@ Added 7 new locales as shown in the following table. See the complete language l
29
36
|`cy-GB`| Welsh (United Kingdom) |
30
37
31
38
32
-
### 2022-Jun release
39
+
### June 2022 release
33
40
34
41
#### New Speech-to-text-locales:
35
42
@@ -49,7 +56,7 @@ Added 10 new locales as shown in the following table. See the complete language
49
56
|`ne-NP`| Nepali (Nepal) |
50
57
51
58
52
-
### 2022-April release
59
+
### April 2022 release
53
60
54
61
#### New Speech-to-text-locales:
55
62
@@ -60,7 +67,7 @@ Below is a list of the new locales. See the complete language list [here](../../
60
67
|`bn-IN`| Bengali (India) |
61
68
62
69
63
-
### 2022-January release
70
+
### January 2022 release
64
71
65
72
#### New Speech-to-text-locales:
66
73
@@ -88,7 +95,7 @@ Below is a list of the new locales. See the complete language list [here](../../
88
95
|`zu-ZA`| Zulu (South Africa) |
89
96
90
97
91
-
### 2021-July release
98
+
### July 2021 release
92
99
93
100
#### New Speech-to-text-locales:
94
101
@@ -116,7 +123,7 @@ Below is a list of the new locales. See the complete language list [here](../../
116
123
|`sw-KE`| Swahili (Kenya) |
117
124
118
125
119
-
### 2021-January release
126
+
### January 2021 release
120
127
121
128
#### New Speech-to-text-locales:
122
129
@@ -142,7 +149,7 @@ Below is a list of the new locales. See the complete language list [here](../../
142
149
|`ms-MY`| Malay (Malaysia) |
143
150
|`vi-VN`| Vietnamese (Vietnam) |
144
151
145
-
### 2020-August Release
152
+
### August 2020 Release
146
153
147
154
#### New speech-to-text locales:
148
155
Speech-to-text released 26 new locales in August: 2 European languages `cs-CZ` and `hu-HU`, 5 English locales and 19 Spanish locales that cover most South American countries. Below is a list of the new locales. See the complete language list [here](../../language-support.md?tabs=stt-tts).
Copy file name to clipboardExpand all lines: articles/cognitive-services/Speech-Service/includes/release-notes/release-notes-tts.md
+31-25Lines changed: 31 additions & 25 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,7 +6,13 @@ ms.date: 11/14/2022
6
6
ms.author: eur
7
7
---
8
8
9
-
### 2022-November release
9
+
### December 2022 release
10
+
11
+
#### Batch synthesis REST API (Preview)
12
+
13
+
The Batch synthesis API is currently in public preview. Once it's generally available, the Long Audio API will be deprecated. For more information, see [Migrate to batch synthesis API](../../migrate-to-batch-synthesis.md).
14
+
15
+
### November 2022 release
10
16
11
17
#### Prebuilt Neural TTS Voices (GA)
12
18
@@ -25,7 +31,7 @@ The following locale support is added for [Custom Neural Voice](../../custom-neu
25
31
- Added support for the `fr-BE` locale with Custom Neural Voice Pro.
26
32
- Added support for the `es-ES` locale with Custom Neural Voice Lite.
27
33
28
-
### 2022-October release
34
+
### October 2022 release
29
35
30
36
#### Prebuilt Neural TTS Voices (GA)
31
37
@@ -61,13 +67,13 @@ The following voices are now available in public preview. See the [full language
61
67
- Added support for the `style="cheerful"` tag with the following voices: `en-GB-RyanNeural`, `en-GB-SoniaNeural`, `es-MX-JorgeNeural`, `fr-FR-DeniseNeural`, `fr-FR-HenriNeural`, and `it-IT-IsabellaNeural`.
62
68
- Added support for the `style="sad"` tag with the following voices: `en-GB-SoniaNeural`, `fr-FR-DeniseNeural` and `fr-FR-HenriNeural`.
63
69
64
-
### 2022-September release
70
+
### September 2022 release
65
71
66
72
#### Prebuilt Neural TTS Voice
67
73
68
74
* All the prebuilt neural voices have been upgraded to high-fidelity voices with 48kHz sample rate.
69
75
70
-
### 2022-August release
76
+
### August 2022 release
71
77
72
78
#### Prebuilt Neural TTS Voice
73
79
@@ -77,7 +83,7 @@ Released new voices in public preview:
77
83
78
84
For more information, see the [language and voice list](../../language-support.md?tabs=stt-tts).
79
85
80
-
### 2022-July release
86
+
### July 2022 release
81
87
82
88
#### Prebuilt Neural TTS Voice
83
89
@@ -107,7 +113,7 @@ For more information, see the [language and voice list](../../language-support.m
107
113
* Added support for blend shapes to drive the facial movements of a 3D character that you designed. Learn more at [how to get facial position with viseme](../../how-to-speech-synthesis-viseme.md).
108
114
* SSML updated to support viseme element. See [speech synthesis markup](../../speech-synthesis-markup-structure.md#viseme-element).
109
115
110
-
### 2022-June release
116
+
### June 2022 release
111
117
112
118
#### Prebuilt Neural TTS Voice
113
119
@@ -234,7 +240,7 @@ For more information, see the [language and voice list](../../language-support.m
234
240
* Supported pagination.
235
241
* Enabled to sort globally by name, file type, and update time on work file page.
236
242
237
-
### 2022-May release
243
+
### May 2022 release
238
244
239
245
#### Prebuilt Neural TTS Voice
240
246
@@ -264,7 +270,7 @@ For more information, see the [language and voice list](../../language-support.m
264
270
* Enhanced performance: Specified the maximum number (200) of files to be uploaded at one time.
265
271
* Enhanced performance: Specified the maximum directory depth level (5 levels).
266
272
267
-
### 2022-March release
273
+
### March 2022 release
268
274
269
275
#### Prebuilt Neural TTS Voice
270
276
@@ -280,7 +286,7 @@ For more information, see the [language and voice list](../../language-support.m
280
286
281
287
* Updated the file size and concurrency limit for free-tier (F0) resources to make the experience consistent with the Speech SDK and APIs. See [speech service quotas and limits](../../speech-services-quotas-and-limits.md#audio-content-creation-tool).
282
288
283
-
### 2022-February release
289
+
### February 2022 release
284
290
285
291
#### Custom Neural Voice
286
292
@@ -292,7 +298,7 @@ For more information, see the [language and voice list](../../language-support.m
292
298
293
299
* Removed the output length limit for downloading audios.
294
300
295
-
### 2022-January release
301
+
### January 2022 release
296
302
297
303
#### New languages and voices
298
304
@@ -379,21 +385,21 @@ For the full list of available voices, see [Language support](../../language-sup
379
385
- Custom Neural Voice: enabled additional model testing using the batch API (long audio API)
380
386
- Audio Content Creation: enabled more output formats
381
387
382
-
### 2021-October release
388
+
### October 2021 release
383
389
384
390
#### New languages and voices
385
391
386
392
Added 49 new languages and 98 voices for Neural text-to-speech:
387
393
388
394
Adri in `af-ZA` Afrikaans (South Africa), Willem in `af-ZA` Afrikaans (South Africa), Mekdes in `am-ET` Amharic (Ethiopia), Ameha in `am-ET` Amharic (Ethiopia), Fatima in `ar-AE` Arabic (United Arab Emirates), Hamdan in `ar-AE` Arabic (United Arab Emirates), Laila in `ar-BH` Arabic (Bahrain), Ali in `ar-BH` Arabic (Bahrain), Amina in `ar-DZ` Arabic (Algeria), Ismael in `ar-DZ` Arabic (Algeria), Rana in `ar-IQ` Arabic (Iraq), Bassel in `ar-IQ` Arabic (Iraq), Sana in `ar-JO` Arabic (Jordan), Taim in `ar-JO` Arabic (Jordan), Noura in `ar-KW` Arabic (Kuwait), Fahed in `ar-KW` Arabic (Kuwait), Iman in `ar-LY` Arabic (Libya), Omar in `ar-LY` Arabic (Libya), Mouna in `ar-MA` Arabic (Morocco), Jamal in `ar-MA` Arabic (Morocco), Amal in `ar-QA` Arabic (Qatar), Moaz in `ar-QA` Arabic (Qatar), Amany in `ar-SY` Arabic (Syria), Laith in `ar-SY` Arabic (Syria), Reem in `ar-TN` Arabic (Tunisia), Hedi in `ar-TN` Arabic (Tunisia), Maryam in `ar-YE` Arabic (Yemen), Saleh in `ar-YE` Arabic (Yemen), Nabanita in `bn-BD` Bangla (Bangladesh), Pradeep in `bn-BD` Bangla (Bangladesh), Asilia in `en-KE` English (Kenya), Chilemba in `en-KE` English (Kenya), Ezinne in `en-NG` English (Nigeria), Abeo in `en-NG` English (Nigeria), Imani in `en-TZ` English (Tanzania), Elimu in `en-TZ` English (Tanzania), Sofia in `es-BO` Spanish (Bolivia), Marcelo in `es-BO` Spanish (Bolivia), Catalina in `es-CL` Spanish (Chile), Lorenzo in `es-CL` Spanish (Chile), Maria in `es-CR` Spanish (Costa Rica), Juan in `es-CR` Spanish (Costa Rica), Belkys in `es-CU` Spanish (Cuba), Manuel in `es-CU` Spanish (Cuba), Ramona in `es-DO` Spanish (Dominican Republic), Emilio in `es-DO` Spanish (Dominican Republic), Andrea in `es-EC` Spanish (Ecuador), Luis in `es-EC` Spanish (Ecuador), Teresa in `es-GQ` Spanish (Equatorial Guinea), Javier in `es-GQ` Spanish (Equatorial Guinea), Marta in `es-GT` Spanish (Guatemala), Andres in `es-GT` Spanish (Guatemala), Karla in `es-HN` Spanish (Honduras), Carlos in `es-HN` Spanish (Honduras), Yolanda in `es-NI` Spanish (Nicaragua), Federico in `es-NI` Spanish (Nicaragua), Margarita in `es-PA` Spanish (Panama), Roberto in `es-PA` Spanish (Panama), Camila in `es-PE` Spanish (Peru), Alex in `es-PE` Spanish (Peru), Karina in `es-PR` Spanish (Puerto Rico), Victor in `es-PR` Spanish (Puerto Rico), Tania in `es-PY` Spanish (Paraguay), Mario in `es-PY` Spanish (Paraguay), Lorena in `es-SV` Spanish (El Salvador), Rodrigo in `es-SV` Spanish (El Salvador), Valentina in `es-UY` Spanish (Uruguay), Mateo in `es-UY` Spanish (Uruguay), Paola in `es-VE` Spanish (Venezuela), Sebastian in `es-VE` Spanish (Venezuela), Dilara in `fa-IR` Persian (Iran), Farid in `fa-IR` Persian (Iran), Blessica in `fil-PH` Filipino (Philippines), Angelo in `fil-PH` Filipino (Philippines), Sabela in `gl-ES` Galician (Spain), Roi in `gl-ES` Galician (Spain), Siti in `jv-ID` Javanese (Indonesia), Dimas in `jv-ID` Javanese (Indonesia), Sreymom in `km-KH` Khmer (Cambodia), Piseth in `km-KH` Khmer (Cambodia), Nilar in `my-MM` Burmese (Myanmar), Thiha in `my-MM` Burmese (Myanmar), Ubax in `so-SO` Somali (Somalia), Muuse in `so-SO` Somali (Somalia), Tuti in `su-ID` Sundanese (Indonesia), Jajang in `su-ID` Sundanese (Indonesia), Rehema in `sw-TZ` Swahili (Tanzania), Daudi in `sw-TZ` Swahili (Tanzania), Saranya in `ta-LK` Tamil (Sri Lanka), Kumar in `ta-LK` Tamil (Sri Lanka), Venba in `ta-SG` Tamil (Singapore), Anbu in `ta-SG` Tamil (Singapore), Gul in `ur-IN` Urdu (India), Salman in `ur-IN` Urdu (India), Madina in `uz-UZ` Uzbek (Uzbekistan), Sardor in `uz-UZ` Uzbek (Uzbekistan), Thando in `zu-ZA` Zulu (South Africa), Themba in `zu-ZA` Zulu (South Africa).
389
395
390
-
### 2021-September release
396
+
### September 2021 release
391
397
-**New chatbot voice in `en-US` English (US)**: Sara, represents a young female adult that talks more casually and fits best for the chatbot scenarios.
392
398
-**New styles added for `ja-JP` Japanese voice Nanami**: Three new styles are now available with Nanami: chat, customer service, and cheerful.
393
399
-**Overall pronunciation improvement**: Ardi in `id-ID`, Premwadee in `th-TH`, Christel in `da-DK`, HoaiMy and NamMinh in `vi-VN`.
394
400
-**Two new voices in `zh-CN` Chinese (Mandarin, China) in preview**: Xiaochen & Xiaoyan, optimized for spontaneous speech and customer service scenarios.
395
401
396
-
### 2021-July release
402
+
### July 2021 release
397
403
398
404
**Neural text-to-speech updates**
399
405
- Reduced pronunciation errors in Hebrew by 20%.
@@ -402,14 +408,14 @@ Adri in `af-ZA` Afrikaans (South Africa), Willem in `af-ZA` Afrikaans (South Afr
402
408
-**Custom Neural Voice**: Updated the training pipeline to UniTTSv3 with which the model quality is improved while training time is reduced by 50% for acoustic models.
403
409
-**Audio Content Creation**: Fixed the "Export" performance issue and the bug on custom neural voice selection.
404
410
405
-
### 2021-June release
411
+
### June 2021 release
406
412
407
413
#### Speech Studio updates
408
414
409
415
-**Custom Neural Voice**: Custom Neural Voice training extended to support South East Asia. New features released to support data uploading status checking.
410
416
-**Audio Content Creation**: Released a new feature to support custom lexicon. With this feature, users can easily create their lexicon files and define the customized pronunciation for their audio output.
411
417
412
-
### 2021-May release
418
+
### May 2021 release
413
419
414
420
**New languages and voices added for neural TTS**
415
421
@@ -419,19 +425,19 @@ Adri in `af-ZA` Afrikaans (South Africa), Willem in `af-ZA` Afrikaans (South Afr
419
425
420
426
-**Five `zh-CN` Chinese (Mandarin, Simplified) voices are generally available** - 5 Chinese (Mandarin, Simplified) voices are changed from preview to generally available. They are Yunxi, Xiaomo, Xiaoman, Xiaoxuan, Xiaorui. Now, these voices are available in all [regions](../../regions.md#speech-service). Yunxi is added with a new 'assistant' style, which is suitable for chat bot and voice agent. Xiaomo's voice styles are refined to be more natural and featured.
421
427
422
-
### 2021-April release
428
+
### April 2021 release
423
429
424
430
**Neural text-to-speech is available across 21 regions**
425
431
426
432
-**Twelve new regions added** - Neural text-to-speech is now available in these new 12 regions: `Japan East`, `Japan West`, `Korea Central`, `North Central US`, `North Europe`, `South Central US`, `Southeast Asia`, `UK South`, `west Central US`, `West Europe`, `West US`, `West US 2`. Check [here](../../regions.md#speech-service) for full list of 21 supported regions.
427
433
428
-
### 2021-March release
434
+
### March 2021 release
429
435
430
436
**New languages and voices added for neural TTS**
431
437
432
438
-**Six new languages introduced** - 12 new voices in 6 new locales are added into the neural TTS language list: Nia in `cy-GB` Welsh (United Kingdom), Aled in `cy-GB` Welsh (United Kingdom), Rosa in `en-PH` English (Philippines), James in `en-PH` English (Philippines), Charline in `fr-BE` French (Belgium), Gerard in `fr-BE` French (Belgium), Dena in `nl-BE` Dutch (Belgium), Arnaud in `nl-BE` Dutch (Belgium), Polina in `uk-UA` Ukrainian (Ukraine), Ostap in `uk-UA` Ukrainian (Ukraine), Uzma in `ur-PK` Urdu (Pakistan), Asad in `ur-PK` Urdu (Pakistan).
433
439
434
-
-**Five languages from preview to GA** - 10 voices in 5 locales introduced in 2020-November now are GA: Kert in `et-EE` Estonian (Estonia), Colm in `ga-IE` Irish (Ireland), Nils in `lv-LV` Latvian (Latvia), Leonas in `lt-LT` Lithuanian (Lithuania), Joseph in `mt-MT` Maltese (Malta).
440
+
-**Five languages from preview to GA** - 10 voices in 5 locales introduced in November now are GA: Kert in `et-EE` Estonian (Estonia), Colm in `ga-IE` Irish (Ireland), Nils in `lv-LV` Latvian (Latvia), Leonas in `lt-LT` Lithuanian (Lithuania), Joseph in `mt-MT` Maltese (Malta).
435
441
436
442
-**New male voice added for French (Canada)** - A new voice Antoine is available for `fr-CA` French (Canada).
437
443
@@ -447,15 +453,15 @@ Neural Text-to-Speech now includes the [viseme event](../../how-to-speech-synthe
447
453
448
454
The [bookmark element](../../speech-synthesis-markup-structure.md#bookmark-element) allows you to insert custom markers in SSML to get the offset of each marker in the audio stream. It can be used to reference a specific location in the text or tag sequence.
449
455
450
-
### 2021-February release
456
+
### February 2021 release
451
457
452
458
**Custom Neural Voice GA**
453
459
454
460
Custom Neural Voice is GA in February in 13 languages: Chinese (Mandarin, Simplified), English (Australia), English (India), English (United Kingdom), English (United States), French (Canada), French (France), German (Germany), Italian (Italy), Japanese (Japan), Korean (Korea), Portuguese (Brazil), Spanish (Mexico), and Spanish (Spain). Learn more about [what is Custom Neural Voice](../../custom-neural-voice.md) and [how to use it responsibly](../../concepts-guidelines-responsible-deployment-synthetic.md).
455
461
Custom Neural Voice feature requires registration and Microsoft may limit access based on Microsoft's eligibility criteria. Learn more about the [limited access](/legal/cognitive-services/speech-service/custom-neural-voice/limited-access-custom-neural-voice?context=/azure/cognitive-services/speech-service/context/context).
456
462
457
463
458
-
### 2020-December release
464
+
### December 2020 release
459
465
460
466
**New neural voices in GA and preview**
461
467
@@ -480,7 +486,7 @@ Visit the [Audio Content Creation tool](https://speech.microsoft.com/audioconten
480
486
- Updated all `zh-CN` multi-style neural voices to support `StyleDegree` control. Emotion intensity (soft or strong) is adjustable.
481
487
- Updated `zh-CN-YunyeNeural` to support multiple styles which can perform different emotions.
482
488
483
-
### 2020-November release
489
+
### November 2020 release
484
490
485
491
**New locales and voices in preview**
486
492
-**Five new voices and languages** are introduced to the Neural text-to-speech portfolio. They are: Grace in Maltese (Malta), Ona in Lithuanian (Lithuania), Anu in Estonian (Estonia), Orla in Irish (Ireland) and Everita in Latvian (Latvia).
@@ -498,7 +504,7 @@ Visit the [Audio Content Creation tool](https://speech.microsoft.com/audioconten
498
504
499
505
> Read more at [this tech blog](https://techcommunity.microsoft.com/t5/azure-ai/neural-text-to-speech-previews-five-new-languages-with/ba-p/1907604).
500
506
501
-
### 2020-October release
507
+
### October 2020 release
502
508
503
509
#### New features
504
510
- Jenny supports a new `newscast` style. See [how to use the speaking styles in SSML](../../speech-synthesis-markup-voice.md#speaking-styles-and-roles).
@@ -512,7 +518,7 @@ Visit the [Audio Content Creation tool](https://speech.microsoft.com/audioconten
-`zh-CN`: Improved Erhua pronunciation and light tone and refined space prosody, which greatly improves intelligibility.
514
520
515
-
### 2020-September release
521
+
### September 2020 release
516
522
517
523
#### New features
518
524
@@ -523,9 +529,9 @@ Visit the [Audio Content Creation tool](https://speech.microsoft.com/audioconten
523
529
524
530
***Containers: Neural text-to-speech Container released in public preview with 16 voices available in 14 languages.** Learn more on [how to deploy Speech Containers for Neural text-to-speech](../../speech-container-howto.md)
525
531
526
-
Read the [full announcement of the TTS updates for Ignite 2020](https://techcommunity.microsoft.com/t5/azure-ai/ignite-2020-neural-tts-updates-new-language-support-more-voices/ba-p/1698544)
532
+
Read the [full announcement of the TTS updates for Ignite 2020](https://techcommunity.microsoft.com/t5/azure-ai/ignite-neural-tts-updates-new-language-support-more-voices/ba-p/1698544)
0 commit comments