You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/cognitive-services/Speech-Service/captioning-concepts.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -225,7 +225,7 @@ Profanity filter is applied to the result `Text` and `MaskedNormalizedForm` prop
225
225
226
226
## Language identification
227
227
228
-
If the language in the audio could change, use continuous [language identification](language-identification.md). Language identification is used to identify languages spoken in audio when compared against a list of [supported languages](language-support.md#language-identification). You provide up to 10 candidate languages, at least one of which is expected be in the audio. The Speech service returns the most likely language in the audio.
228
+
If the language in the audio could change, use continuous [language identification](language-identification.md). Language identification is used to identify languages spoken in audio when compared against a list of [supported languages](language-support.md#speech-to-text-and-text-to-speech). You provide up to 10 candidate languages, at least one of which is expected be in the audio. The Speech service returns the most likely language in the audio.
Copy file name to clipboardExpand all lines: articles/cognitive-services/Speech-Service/custom-neural-voice.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -16,7 +16,7 @@ ms.author: eur
16
16
17
17
Custom Neural Voice is a text-to-speech feature that lets you create a one-of-a-kind, customized, synthetic voice for your applications. With Custom Neural Voice, you can build a highly natural-sounding voice by providing your audio samples as training data. If you're looking for ready-to-use options, check out our [text-to-speech](text-to-speech.md) service.
18
18
19
-
Based on the neural text-to-speech technology and the multilingual, multi-speaker, universal model, Custom Neural Voice lets you create synthetic voices that are rich in speaking styles, or adaptable cross languages. The realistic and natural sounding voice of Custom Neural Voice can represent brands, personify machines, and allow users to interact with applications conversationally. See the [supported languages](language-support.md#custom-neural-voice) for Custom Neural Voice.
19
+
Based on the neural text-to-speech technology and the multilingual, multi-speaker, universal model, Custom Neural Voice lets you create synthetic voices that are rich in speaking styles, or adaptable cross languages. The realistic and natural sounding voice of Custom Neural Voice can represent brands, personify machines, and allow users to interact with applications conversationally. See the [supported languages](language-support.md#speech-to-text-and-text-to-speech) for Custom Neural Voice.
20
20
21
21
> [!IMPORTANT]
22
22
> Custom Neural Voice access is limited based on eligibility and usage criteria. Request access on the [intake form](https://aka.ms/customneural).
Copy file name to clipboardExpand all lines: articles/cognitive-services/Speech-Service/how-to-custom-voice-create-voice.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -168,7 +168,7 @@ After you validate your data files, you can use them to build your Custom Neural
168
168
169
169
If you want to create a voice in the same language of your training data, select **Neural** method. For the **Neural** method, you can select different versions of the training recipe for your model. The versions vary according to the features supported and model training time. Normally new versions are enhanced ones with bugs fixed and new features supported. The latest version is selected by default.
170
170
171
-
You can also select **Neural - cross lingual** and **Target language** to create a secondary language for your voice model. Only one target language can be selected for a voice model. You don't need to prepare additional data in the target language for training, but your test script needs to be in the target language. For the languages supported by cross lingual feature, see [supported languages](language-support.md#custom-neural-voice).
171
+
You can also select **Neural - cross lingual** and **Target language** to create a secondary language for your voice model. Only one target language can be selected for a voice model. You don't need to prepare additional data in the target language for training, but your test script needs to be in the target language. For the languages supported by cross lingual feature, see [supported languages](language-support.md#speech-to-text-and-text-to-speech).
172
172
173
173
The same unit price applies to both **Neural** and **Neural - cross lingual**. Check [the pricing details](https://azure.microsoft.com/pricing/details/cognitive-services/speech-services/) for training.
Copy file name to clipboardExpand all lines: articles/cognitive-services/Speech-Service/how-to-custom-voice.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -14,7 +14,7 @@ ms.author: eur
14
14
15
15
# Create a Project
16
16
17
-
[Custom Neural Voice](https://aka.ms/customvoice) is a set of online tools that you use to create a recognizable, one-of-a-kind voice for your brand. All it takes to get started are a handful of audio files and the associated transcriptions. See if Custom Neural Voice supports your [language](language-support.md#custom-neural-voice) and [region](regions.md#speech-service).
17
+
[Custom Neural Voice](https://aka.ms/customvoice) is a set of online tools that you use to create a recognizable, one-of-a-kind voice for your brand. All it takes to get started are a handful of audio files and the associated transcriptions. See if Custom Neural Voice supports your [language](language-support.md#speech-to-text-and-text-to-speech) and [region](regions.md#speech-service).
18
18
19
19
> [!IMPORTANT]
20
20
> Custom Neural Voice Pro can be used to create higher-quality models that are indistinguishable from human recordings. For access you must commit to using it in alignment with our responsible AI principles. Learn more about our [policy on limited access](/legal/cognitive-services/speech-service/custom-neural-voice/limited-access-custom-neural-voice?context=%2fazure%2fcognitive-services%2fspeech-service%2fcontext%2fcontext) and [apply here](https://aka.ms/customneural).
@@ -50,7 +50,7 @@ To create a custom voice project:
50
50
51
51
## Cross lingual feature
52
52
53
-
With cross lingual feature (public preview), you can create a different language for your voice model. If the language of your training data is supported by cross lingual feature, you can create a voice that speaks a different language from your training data. For example, with the `zh-CN` training data, you can create a voice that speaks `en-US` or any of the languages supported by cross lingual feature. For details, see [supported languages](language-support.md#custom-neural-voice). You don't need to prepare additional data in the target language for training, but your test script needs to be in the target language.
53
+
With cross lingual feature (public preview), you can create a different language for your voice model. If the language of your training data is supported by cross lingual feature, you can create a voice that speaks a different language from your training data. For example, with the `zh-CN` training data, you can create a voice that speaks `en-US` or any of the languages supported by cross lingual feature. For details, see [supported languages](language-support.md#speech-to-text-and-text-to-speech). You don't need to prepare additional data in the target language for training, but your test script needs to be in the target language.
54
54
55
55
For how to create a different language from your training data, select the training method **Neural-cross lingual** during training. See [how to train your custom neural voice model](how-to-custom-voice-create-voice.md#train-your-custom-neural-voice-model).
> Viseme ID supports neural voices in [all viseme-supported locales](language-support.md#viseme). Scalable Vector Graphics (SVG) only supports neural voices in `en-US` locale, and blend shapes supports neural voices in `en-US` and `zh-CN` locales.
21
+
> Viseme ID supports neural voices in [all viseme-supported locales](language-support.md#speech-to-text-and-text-to-speech). Scalable Vector Graphics (SVG) only supports neural voices in `en-US` locale, and blend shapes supports neural voices in `en-US` and `zh-CN` locales.
22
22
23
23
A *viseme* is the visual description of a phoneme in spoken language. It defines the position of the face and mouth while a person is speaking. Each viseme depicts the key facial poses for a specific set of phonemes.
Copy file name to clipboardExpand all lines: articles/cognitive-services/Speech-Service/includes/cognitive-services-speech-service-endpoints-text-to-speech.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -47,7 +47,7 @@ Use this table to determine *availability of neural voices* by region or endpoin
47
47
| West US 3 |`https://westus3.tts.speech.microsoft.com/cognitiveservices/v1`|
48
48
49
49
> [!TIP]
50
-
> [Voices in preview](../language-support.md#prebuilt-neural-voices-in-preview) are available in only these three regions: East US, West Europe, and Southeast Asia.
50
+
> [Voices in preview](../language-support.md#speech-to-text-and-text-to-speech) are available in only these three regions: East US, West Europe, and Southeast Asia.
Copy file name to clipboardExpand all lines: articles/cognitive-services/Speech-Service/includes/release-notes/release-notes-tts.md
+8-9Lines changed: 8 additions & 9 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -9,7 +9,7 @@ ms.author: eur
9
9
10
10
#### Prebuilt Neural TTS Voice
11
11
12
-
* Added 5 new voices of `zh-CN` Chinese (Mandarin, Simplified) and 1 new voice of `en-US` English (United States) in Public Preview. See [full language and voice list](../../language-support.md#prebuilt-neural-voices).
12
+
* Added 5 new voices of `zh-CN` Chinese (Mandarin, Simplified) and 1 new voice of `en-US` English (United States) in Public Preview. See [full language and voice list](../../language-support.md#speech-to-text-and-text-to-speech).
| English (United Kingdom) |`en-GB`| Female |`en-GB-BellaNeural`| General |
71
71
| English (United Kingdom) |`en-GB`| Female |`en-GB-HollieNeural`| General |
72
72
| English (United Kingdom) |`en-GB`| Female |`en-GB-MaisieNeural`| General, child voice |
73
-
| English (United Kingdom) |`en-GB`| Female |`en-GB-MiaNeural` <sup>Retired on 30 October 2021, see below</sup> | General |
74
73
| English (United Kingdom) |`en-GB`| Female |`en-GB-OliviaNeural`| General |
75
74
| English (United Kingdom) |`en-GB`| Female |`en-GB-SoniaNeural`| General |
76
75
| English (United Kingdom) |`en-GB`| Male |`en-GB-AlfieNeural`| General |
@@ -167,11 +166,11 @@ ms.author: eur
167
166
168
167
#### Prebuilt Neural TTS Voice
169
168
170
-
* Released 5 new voices in public preview with multiple styles to enrich the variety in American English. See [full language and voice list](../../language-support.md#prebuilt-neural-voices).
169
+
* Released 5 new voices in public preview with multiple styles to enrich the variety in American English. See [full language and voice list](../../language-support.md#speech-to-text-and-text-to-speech).
171
170
* Support these new styles `Angry`, `Excited`, `Friendly`, `Hopeful`, `Sad`, `Shouting`, `Unfriendly`, `Terrified` and `Whispering` in public preview for `en-US-AriaNeural`.
172
171
* Support these new styles `Angry`, `Cheerful`, `Excited`, `Friendly`, `Hopeful`, `Sad`, `Shouting`, `Unfriendly`, `Terrified` and `Whispering` in public preview for `en-US-GuyNeural`, `en-US-JennyNeural`.
173
172
* Support these new styles `Excited`, `Friendly`, `Hopeful`, `Shouting`, `Unfriendly`, `Terrified` and `Whispering` in public preview for `en-US-SaraNeural`. See [voice styles and roles](../../language-support.md#voice-styles-and-roles).
174
-
* Released new voices `zh-CN-YunjianNeural`, `zh-CN-YunhaoNeural`, and `zh-CN-YunfengNeural` in public preview. See [full language and voice list](../../language-support.md#prebuilt-neural-voices).
173
+
* Released new voices `zh-CN-YunjianNeural`, `zh-CN-YunhaoNeural`, and `zh-CN-YunfengNeural` in public preview. See [full language and voice list](../../language-support.md#speech-to-text-and-text-to-speech).
175
174
* Support 2 new styles `sports-commentary`, `sports-commentary-excited` in public preview for `zh-CN-YunjianNeural`. See [voice styles and roles](../../language-support.md#voice-styles-and-roles).
176
175
* Support 1 new style `advertisement-upbeat` in public preview for `zh-CN-YunhaoNeural`. See [voice styles and roles](../../language-support.md#voice-styles-and-roles).
177
176
* The `cheerful` and `sad` styles for `fr-FR-DeniseNeural` are generally available in all regions.
@@ -184,7 +183,7 @@ ms.author: eur
184
183
* Enabled to test your voice model by adding your own test script. Learn more on [how to upload your test script](../../how-to-custom-voice-create-voice.md#test-your-voice-model).
185
184
* Enabled to update engine version for your voice model. Learn more on [how to update the model engine version](../../how-to-custom-voice-create-voice.md#update-engine-version-for-your-voice-model).
186
185
* Supported more training regions. See [region support](../../regions.md#speech-service).
187
-
* Supported 10 locales for Custom Neural Voice Lite (preview). See [language support](../../language-support.md#custom-neural-voice).
186
+
* Supported 10 locales for Custom Neural Voice Lite (preview). See [language support](../../language-support.md#speech-to-text-and-text-to-speech).
188
187
189
188
#### Audio Content Creation tool
190
189
@@ -214,7 +213,7 @@ ms.author: eur
214
213
#### Custom Neural Voice
215
214
216
215
* Released Custom Neural Voice Lite in public preview. Learn more about [what is Custom Neural Voice Lite](../../custom-neural-voice.md#custom-neural-voice-project-types).
217
-
* Extended language support to 49 locales. See [language support](../../language-support.md#custom-neural-voice).
216
+
* Extended language support to 49 locales. See [language support](../../language-support.md#speech-to-text-and-text-to-speech).
218
217
* Supported more regions/datacenters. See [region support](../../regions.md#speech-service).
219
218
220
219
#### Audio Content Creation tool
@@ -250,7 +249,7 @@ Added 10 new languages and variants for Neural text-to-speech:
| Sinhala (Sri Lanka) |`si-LK`| Male |`si-LK-SameeraNeural` <sup>New</sup> | General |
252
251
253
-
For the full list of available voices, see [Language support](../../language-support.md#prebuilt-neural-voices).
252
+
For the full list of available voices, see [Language support](../../language-support.md#speech-to-text-and-text-to-speech).
254
253
255
254
#### New voices in preview
256
255
@@ -295,7 +294,7 @@ Added new voices for en-GB, fr-FR and de-DE in preview:
295
294
| German (Germany) |`de-DE`| Male |`de-DE-KlausNeural` <sup>New</sup> | General |
296
295
| German (Germany) |`de-DE`| Male |`de-DE-RalfNeural` <sup>New</sup> | General |
297
296
298
-
For the full list of available voices, see [Language support](../../language-support.md#prebuilt-neural-voices-in-preview).
297
+
For the full list of available voices, see [Language support](../../language-support.md#speech-to-text-and-text-to-speech).
299
298
300
299
#### Pronunciation accuracy
301
300
@@ -447,7 +446,7 @@ Visit the [Audio Content Creation tool](https://speech.microsoft.com/audioconten
447
446
448
447
***Neural text-to-speech**
449
448
***Extended to support 18 new languages/locales.** They are Bulgarian, Czech, German (Austria), German (Switzerland), Greek, English (Ireland), French (Switzerland), Hebrew, Croatian, Hungarian, Indonesian, Malay, Romanian, Slovak, Slovenian, Tamil, Telugu and Vietnamese.
450
-
***Released 14 new voices to enrich the variety in the existing languages.** See [full language and voice list](../../language-support.md#prebuilt-neural-voices).
449
+
***Released 14 new voices to enrich the variety in the existing languages.** See [full language and voice list](../../language-support.md#speech-to-text-and-text-to-speech).
451
450
***New speaking styles for `en-US` and `zh-CN` voices.** Jenny, the new voice in English (US), supports chatbot, customer service, and assistant styles. 10 new speaking styles are available with our zh-CN voice, XiaoXiao. In addition, the XiaoXiao neural voice supports `StyleDegree` tuning. See [how to use the speaking styles in SSML](../../speech-synthesis-markup.md#adjust-speaking-styles).
452
451
453
452
***Containers: Neural text-to-speech Container released in public preview with 16 voices available in 14 languages.** Learn more on [how to deploy Speech Containers for Neural text-to-speech](../../speech-container-howto.md)
Language identification is used to identify languages spoken in audio when compared against a list of [supported languages](language-support.md#language-identification).
18
+
Language identification is used to identify languages spoken in audio when compared against a list of [supported languages](language-support.md#speech-to-text-and-text-to-speech).
0 commit comments