You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/cognitive-services/Speech-Service/includes/language-support/tts.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -156,7 +156,7 @@ ms.author: eur
156
156
157
157
<sup>3</sup> The neural voice is available in public preview. Voices and styles in public preview are only available in three service [regions](../../regions.md): East US, West Europe, and Southeast Asia.
158
158
159
-
<sup>4</sup> Visemes are supported for the locale of the neural voice. However, SVG is only supported for neural voices in the `en-US` locale, and blend shapes is only supported for neural voices in the `en-US` and `zh-CN` locales. For more information, see [Get facial position with viseme](../../how-to-speech-synthesis-viseme.md).
159
+
<sup>4</sup> Visemes are supported for the locale of the neural voice. However, SVG is only supported for neural voices in the `en-US` locale, and blend shapes is only supported for neural voices in the `en-US` and `zh-CN` locales. For more information, see [Get facial position with viseme](../../how-to-speech-synthesis-viseme.md) and [Viseme element](../../speech-synthesis-markup.md#viseme-element).
160
160
161
161
<sup>5</sup> Phonemes are supported for the locale of the neural voice. For more information, see [SSML phonetic alphabets](../../speech-ssml-phonetic-sets.md) and [Use phonemes to improve pronunciation](../../speech-synthesis-markup.md#use-phonemes-to-improve-pronunciation).
Copy file name to clipboardExpand all lines: articles/cognitive-services/Speech-Service/language-support.md
+13-13Lines changed: 13 additions & 13 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -25,37 +25,29 @@ Language support varies by Speech service functionality.
25
25
26
26
# [Speech-to-text](#tab/stt)
27
27
28
-
The following table summarizes locale and voice support for Speech-to-text. Additional remarks are included in the [Custom Speech](#custom-speech)sections below.
28
+
The following table summarizes the locales supported for Speech-to-text. Additional remarks are included in the [Custom Speech](#custom-speech)section below.
29
29
30
30
[!INCLUDE [Language support include](includes/language-support/stt.md)]
31
31
32
-
## Custom Speech
32
+
###Custom Speech
33
33
34
34
To improve accuracy, customization is available for some languages and base model versions by uploading audio + human-labeled transcripts, plain text, structured text, and pronunciation. By default, plain text customization is supported for all available base models. To learn more about customization, see [Custom Speech](./custom-speech-overview.md).
35
35
36
36
# [Text-to-speech](#tab/tts)
37
37
38
-
The following table summarizes locale and voice support for Text-to-speech. Please see the table footnotes for more details. Additional remarks are included in the [Custom Neural Voice](#custom-neural-voice), [Prebuilt neural voices](#prebuilt-neural-voices), and [Voice styles and roles](#voice-styles-and-roles) sections below.
38
+
The following table summarizes the locales and voices supported for Text-to-speech. Please see the table footnotes for more details. Additional remarks are included in the [Custom Neural Voice](#custom-neural-voice), [Prebuilt neural voices](#prebuilt-neural-voices), and [Voice styles and roles](#voice-styles-and-roles) sections below.
39
39
40
40
[!INCLUDE [Language support include](includes/language-support/tts.md)]
41
41
42
-
Each prebuilt neural voice supports a specific language and dialect, identified by locale. You can try the demo and hear the voices on [this website](https://azure.microsoft.com/services/cognitive-services/text-to-speech/#features).
43
-
44
42
You can also get a full list of languages and voices supported for each specific region or endpoint through the [voices list API](rest-text-to-speech.md#get-a-list-of-voices). To learn how you can configure and adjust neural voices, such as Speaking Styles, see [Speech Synthesis Markup Language](speech-synthesis-markup.md#adjust-speaking-styles).
45
43
46
44
> [!IMPORTANT]
47
45
> Pricing varies for Prebuilt Neural Voice (referred to as *Neural* on the pricing page) and Custom Neural Voice (referred to as *Custom Neural* on the pricing page). For more information, see the [Pricing](https://azure.microsoft.com/pricing/details/cognitive-services/speech-services/) page.
48
46
49
-
### Custom Neural Voice
50
-
51
-
Custom Neural Voice lets you create synthetic voices that are rich in speaking styles. You can create a unique brand voice in multiple languages and styles by using a small set of recording data. There are two Custom Neural Voice (CNV) project types: CNV Pro and CNV Lite (preview).
52
-
53
-
Select the right locale that matches your training data to train a custom neural voice model. For example, if the recording data is spoken in English with a British accent, select `en-GB`.
54
-
55
-
With the cross-lingual feature (preview), you can transfer your custom neural voice model to speak a second language. For example, with the `zh-CN` data, you can create a voice that speaks `en-AU` or any of the languages with Cross-lingual support.
56
-
57
47
### Prebuilt neural voices
58
48
49
+
Each prebuilt neural voice supports a specific language and dialect, identified by locale. You can try the demo and hear the voices on [this website](https://azure.microsoft.com/services/cognitive-services/text-to-speech/#features).
50
+
59
51
Prebuilt neural voices are created from samples that use a 24-khz sample rate. All voices can upsample or downsample to other sample rates when synthesizing.
60
52
61
53
Please note that the following neural voices are retired.
@@ -73,6 +65,14 @@ Use the following table to determine supported styles and roles for each neural
73
65
74
66
[!INCLUDE [Language support include](includes/language-support/voice-styles-and-roles.md)]
75
67
68
+
### Custom Neural Voice
69
+
70
+
Custom Neural Voice lets you create synthetic voices that are rich in speaking styles. You can create a unique brand voice in multiple languages and styles by using a small set of recording data. There are two Custom Neural Voice (CNV) project types: CNV Pro and CNV Lite (preview).
71
+
72
+
Select the right locale that matches your training data to train a custom neural voice model. For example, if the recording data is spoken in English with a British accent, select `en-GB`.
73
+
74
+
With the cross-lingual feature (preview), you can transfer your custom neural voice model to speak a second language. For example, with the `zh-CN` data, you can create a voice that speaks `en-AU` or any of the languages with Cross-lingual support.
0 commit comments