Skip to content

Commit 3847667

Browse files
committed
consolidate tables draft
1 parent 275fbd1 commit 3847667

11 files changed

+205
-1071
lines changed

articles/cognitive-services/Speech-Service/captioning-concepts.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -225,7 +225,7 @@ Profanity filter is applied to the result `Text` and `MaskedNormalizedForm` prop
225225

226226
## Language identification
227227

228-
If the language in the audio could change, use continuous [language identification](language-identification.md). Language identification is used to identify languages spoken in audio when compared against a list of [supported languages](language-support.md#language-identification). You provide up to 10 candidate languages, at least one of which is expected be in the audio. The Speech service returns the most likely language in the audio.
228+
If the language in the audio could change, use continuous [language identification](language-identification.md). Language identification is used to identify languages spoken in audio when compared against a list of [supported languages](language-support.md#speech-to-text-and-text-to-speech). You provide up to 10 candidate languages, at least one of which is expected be in the audio. The Speech service returns the most likely language in the audio.
229229

230230
## Customizations to improve accuracy
231231

articles/cognitive-services/Speech-Service/custom-neural-voice.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -16,7 +16,7 @@ ms.author: eur
1616

1717
Custom Neural Voice is a text-to-speech feature that lets you create a one-of-a-kind, customized, synthetic voice for your applications. With Custom Neural Voice, you can build a highly natural-sounding voice by providing your audio samples as training data. If you're looking for ready-to-use options, check out our [text-to-speech](text-to-speech.md) service.
1818

19-
Based on the neural text-to-speech technology and the multilingual, multi-speaker, universal model, Custom Neural Voice lets you create synthetic voices that are rich in speaking styles, or adaptable cross languages. The realistic and natural sounding voice of Custom Neural Voice can represent brands, personify machines, and allow users to interact with applications conversationally. See the [supported languages](language-support.md#custom-neural-voice) for Custom Neural Voice.
19+
Based on the neural text-to-speech technology and the multilingual, multi-speaker, universal model, Custom Neural Voice lets you create synthetic voices that are rich in speaking styles, or adaptable cross languages. The realistic and natural sounding voice of Custom Neural Voice can represent brands, personify machines, and allow users to interact with applications conversationally. See the [supported languages](language-support.md#speech-to-text-and-text-to-speech) for Custom Neural Voice.
2020

2121
> [!IMPORTANT]
2222
> Custom Neural Voice access is limited based on eligibility and usage criteria. Request access on the [intake form](https://aka.ms/customneural).

articles/cognitive-services/Speech-Service/how-to-custom-voice-create-voice.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -168,7 +168,7 @@ After you validate your data files, you can use them to build your Custom Neural
168168

169169
If you want to create a voice in the same language of your training data, select **Neural** method. For the **Neural** method, you can select different versions of the training recipe for your model. The versions vary according to the features supported and model training time. Normally new versions are enhanced ones with bugs fixed and new features supported. The latest version is selected by default.
170170

171-
You can also select **Neural - cross lingual** and **Target language** to create a secondary language for your voice model. Only one target language can be selected for a voice model. You don't need to prepare additional data in the target language for training, but your test script needs to be in the target language. For the languages supported by cross lingual feature, see [supported languages](language-support.md#custom-neural-voice).
171+
You can also select **Neural - cross lingual** and **Target language** to create a secondary language for your voice model. Only one target language can be selected for a voice model. You don't need to prepare additional data in the target language for training, but your test script needs to be in the target language. For the languages supported by cross lingual feature, see [supported languages](language-support.md#speech-to-text-and-text-to-speech).
172172

173173
The same unit price applies to both **Neural** and **Neural - cross lingual**. Check [the pricing details](https://azure.microsoft.com/pricing/details/cognitive-services/speech-services/) for training.
174174

articles/cognitive-services/Speech-Service/how-to-custom-voice.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@ ms.author: eur
1414

1515
# Create a Project
1616

17-
[Custom Neural Voice](https://aka.ms/customvoice) is a set of online tools that you use to create a recognizable, one-of-a-kind voice for your brand. All it takes to get started are a handful of audio files and the associated transcriptions. See if Custom Neural Voice supports your [language](language-support.md#custom-neural-voice) and [region](regions.md#speech-service).
17+
[Custom Neural Voice](https://aka.ms/customvoice) is a set of online tools that you use to create a recognizable, one-of-a-kind voice for your brand. All it takes to get started are a handful of audio files and the associated transcriptions. See if Custom Neural Voice supports your [language](language-support.md#speech-to-text-and-text-to-speech) and [region](regions.md#speech-service).
1818

1919
> [!IMPORTANT]
2020
> Custom Neural Voice Pro can be used to create higher-quality models that are indistinguishable from human recordings. For access you must commit to using it in alignment with our responsible AI principles. Learn more about our [policy on limited access](/legal/cognitive-services/speech-service/custom-neural-voice/limited-access-custom-neural-voice?context=%2fazure%2fcognitive-services%2fspeech-service%2fcontext%2fcontext) and [apply here](https://aka.ms/customneural).
@@ -50,7 +50,7 @@ To create a custom voice project:
5050

5151
## Cross lingual feature
5252

53-
With cross lingual feature (public preview), you can create a different language for your voice model. If the language of your training data is supported by cross lingual feature, you can create a voice that speaks a different language from your training data. For example, with the `zh-CN` training data, you can create a voice that speaks `en-US` or any of the languages supported by cross lingual feature. For details, see [supported languages](language-support.md#custom-neural-voice). You don't need to prepare additional data in the target language for training, but your test script needs to be in the target language.
53+
With cross lingual feature (public preview), you can create a different language for your voice model. If the language of your training data is supported by cross lingual feature, you can create a voice that speaks a different language from your training data. For example, with the `zh-CN` training data, you can create a voice that speaks `en-US` or any of the languages supported by cross lingual feature. For details, see [supported languages](language-support.md#speech-to-text-and-text-to-speech). You don't need to prepare additional data in the target language for training, but your test script needs to be in the target language.
5454

5555
For how to create a different language from your training data, select the training method **Neural-cross lingual** during training. See [how to train your custom neural voice model](how-to-custom-voice-create-voice.md#train-your-custom-neural-voice-model).
5656

articles/cognitive-services/Speech-Service/how-to-speech-synthesis-viseme.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,7 @@ zone_pivot_groups: programming-languages-speech-services-nomore-variant
1818
# Get facial position with viseme
1919

2020
> [!NOTE]
21-
> Viseme ID supports neural voices in [all viseme-supported locales](language-support.md#viseme). Scalable Vector Graphics (SVG) only supports neural voices in `en-US` locale, and blend shapes supports neural voices in `en-US` and `zh-CN` locales.
21+
> Viseme ID supports neural voices in [all viseme-supported locales](language-support.md#speech-to-text-and-text-to-speech). Scalable Vector Graphics (SVG) only supports neural voices in `en-US` locale, and blend shapes supports neural voices in `en-US` and `zh-CN` locales.
2222
2323
A *viseme* is the visual description of a phoneme in spoken language. It defines the position of the face and mouth while a person is speaking. Each viseme depicts the key facial poses for a specific set of phonemes.
2424

articles/cognitive-services/Speech-Service/includes/cognitive-services-speech-service-endpoints-text-to-speech.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -47,7 +47,7 @@ Use this table to determine *availability of neural voices* by region or endpoin
4747
| West US 3 | `https://westus3.tts.speech.microsoft.com/cognitiveservices/v1` |
4848

4949
> [!TIP]
50-
> [Voices in preview](../language-support.md#prebuilt-neural-voices-in-preview) are available in only these three regions: East US, West Europe, and Southeast Asia.
50+
> [Voices in preview](../language-support.md#speech-to-text-and-text-to-speech) are available in only these three regions: East US, West Europe, and Southeast Asia.
5151
5252
### Custom neural voices
5353

articles/cognitive-services/Speech-Service/includes/release-notes/release-notes-tts.md

Lines changed: 8 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ ms.author: eur
99

1010
#### Prebuilt Neural TTS Voice
1111

12-
* Added 5 new voices of `zh-CN` Chinese (Mandarin, Simplified) and 1 new voice of `en-US` English (United States) in Public Preview. See [full language and voice list](../../language-support.md#prebuilt-neural-voices).
12+
* Added 5 new voices of `zh-CN` Chinese (Mandarin, Simplified) and 1 new voice of `en-US` English (United States) in Public Preview. See [full language and voice list](../../language-support.md#speech-to-text-and-text-to-speech).
1313

1414
| Language | Locale | Gender | Voice name| Style support|
1515
|---|---|---|---|---|
@@ -70,7 +70,6 @@ ms.author: eur
7070
| English (United Kingdom) | `en-GB` | Female | `en-GB-BellaNeural` | General |
7171
| English (United Kingdom) | `en-GB` | Female | `en-GB-HollieNeural` | General |
7272
| English (United Kingdom) | `en-GB` | Female | `en-GB-MaisieNeural` | General, child voice |
73-
| English (United Kingdom) | `en-GB` | Female | `en-GB-MiaNeural` <sup>Retired on 30 October 2021, see below</sup> | General |
7473
| English (United Kingdom) | `en-GB` | Female | `en-GB-OliviaNeural` | General |
7574
| English (United Kingdom) | `en-GB` | Female | `en-GB-SoniaNeural` | General |
7675
| English (United Kingdom) | `en-GB` | Male | `en-GB-AlfieNeural` | General |
@@ -167,11 +166,11 @@ ms.author: eur
167166

168167
#### Prebuilt Neural TTS Voice
169168

170-
* Released 5 new voices in public preview with multiple styles to enrich the variety in American English. See [full language and voice list](../../language-support.md#prebuilt-neural-voices).
169+
* Released 5 new voices in public preview with multiple styles to enrich the variety in American English. See [full language and voice list](../../language-support.md#speech-to-text-and-text-to-speech).
171170
* Support these new styles `Angry`, `Excited`, `Friendly`, `Hopeful`, `Sad`, `Shouting`, `Unfriendly`, `Terrified` and `Whispering` in public preview for `en-US-AriaNeural`.
172171
* Support these new styles `Angry`, `Cheerful`, `Excited`, `Friendly`, `Hopeful`, `Sad`, `Shouting`, `Unfriendly`, `Terrified` and `Whispering` in public preview for `en-US-GuyNeural`, `en-US-JennyNeural`.
173172
* Support these new styles `Excited`, `Friendly`, `Hopeful`, `Shouting`, `Unfriendly`, `Terrified` and `Whispering` in public preview for `en-US-SaraNeural`. See [voice styles and roles](../../language-support.md#voice-styles-and-roles).
174-
* Released new voices `zh-CN-YunjianNeural`, `zh-CN-YunhaoNeural`, and `zh-CN-YunfengNeural` in public preview. See [full language and voice list](../../language-support.md#prebuilt-neural-voices).
173+
* Released new voices `zh-CN-YunjianNeural`, `zh-CN-YunhaoNeural`, and `zh-CN-YunfengNeural` in public preview. See [full language and voice list](../../language-support.md#speech-to-text-and-text-to-speech).
175174
* Support 2 new styles `sports-commentary`, `sports-commentary-excited` in public preview for `zh-CN-YunjianNeural`. See [voice styles and roles](../../language-support.md#voice-styles-and-roles).
176175
* Support 1 new style `advertisement-upbeat` in public preview for `zh-CN-YunhaoNeural`. See [voice styles and roles](../../language-support.md#voice-styles-and-roles).
177176
* The `cheerful` and `sad` styles for `fr-FR-DeniseNeural` are generally available in all regions.
@@ -184,7 +183,7 @@ ms.author: eur
184183
* Enabled to test your voice model by adding your own test script. Learn more on [how to upload your test script](../../how-to-custom-voice-create-voice.md#test-your-voice-model).
185184
* Enabled to update engine version for your voice model. Learn more on [how to update the model engine version](../../how-to-custom-voice-create-voice.md#update-engine-version-for-your-voice-model).
186185
* Supported more training regions. See [region support](../../regions.md#speech-service).
187-
* Supported 10 locales for Custom Neural Voice Lite (preview). See [language support](../../language-support.md#custom-neural-voice).
186+
* Supported 10 locales for Custom Neural Voice Lite (preview). See [language support](../../language-support.md#speech-to-text-and-text-to-speech).
188187

189188
#### Audio Content Creation tool
190189

@@ -214,7 +213,7 @@ ms.author: eur
214213
#### Custom Neural Voice
215214

216215
* Released Custom Neural Voice Lite in public preview. Learn more about [what is Custom Neural Voice Lite](../../custom-neural-voice.md#custom-neural-voice-project-types).
217-
* Extended language support to 49 locales. See [language support](../../language-support.md#custom-neural-voice).
216+
* Extended language support to 49 locales. See [language support](../../language-support.md#speech-to-text-and-text-to-speech).
218217
* Supported more regions/datacenters. See [region support](../../regions.md#speech-service).
219218

220219
#### Audio Content Creation tool
@@ -250,7 +249,7 @@ Added 10 new languages and variants for Neural text-to-speech:
250249
| Sinhala (Sri Lanka) | `si-LK` | Female | `si-LK-ThiliniNeural` <sup>New</sup> | General |
251250
| Sinhala (Sri Lanka) | `si-LK` | Male | `si-LK-SameeraNeural` <sup>New</sup> | General |
252251

253-
For the full list of available voices, see [Language support](../../language-support.md#prebuilt-neural-voices).
252+
For the full list of available voices, see [Language support](../../language-support.md#speech-to-text-and-text-to-speech).
254253

255254
#### New voices in preview
256255

@@ -295,7 +294,7 @@ Added new voices for en-GB, fr-FR and de-DE in preview:
295294
| German (Germany) | `de-DE` | Male | `de-DE-KlausNeural` <sup>New</sup> | General |
296295
| German (Germany) | `de-DE` | Male | `de-DE-RalfNeural` <sup>New</sup> | General |
297296

298-
For the full list of available voices, see [Language support](../../language-support.md#prebuilt-neural-voices-in-preview).
297+
For the full list of available voices, see [Language support](../../language-support.md#speech-to-text-and-text-to-speech).
299298

300299
#### Pronunciation accuracy
301300

@@ -447,7 +446,7 @@ Visit the [Audio Content Creation tool](https://speech.microsoft.com/audioconten
447446

448447
* **Neural text-to-speech**
449448
* **Extended to support 18 new languages/locales.** They are Bulgarian, Czech, German (Austria), German (Switzerland), Greek, English (Ireland), French (Switzerland), Hebrew, Croatian, Hungarian, Indonesian, Malay, Romanian, Slovak, Slovenian, Tamil, Telugu and Vietnamese.
450-
* **Released 14 new voices to enrich the variety in the existing languages.** See [full language and voice list](../../language-support.md#prebuilt-neural-voices).
449+
* **Released 14 new voices to enrich the variety in the existing languages.** See [full language and voice list](../../language-support.md#speech-to-text-and-text-to-speech).
451450
* **New speaking styles for `en-US` and `zh-CN` voices.** Jenny, the new voice in English (US), supports chatbot, customer service, and assistant styles. 10 new speaking styles are available with our zh-CN voice, XiaoXiao. In addition, the XiaoXiao neural voice supports `StyleDegree` tuning. See [how to use the speaking styles in SSML](../../speech-synthesis-markup.md#adjust-speaking-styles).
452451

453452
* **Containers: Neural text-to-speech Container released in public preview with 16 voices available in 14 languages.** Learn more on [how to deploy Speech Containers for Neural text-to-speech](../../speech-container-howto.md)

articles/cognitive-services/Speech-Service/language-identification.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,7 @@ zone_pivot_groups: programming-languages-speech-services-nomore-variant
1515

1616
# Language identification (preview)
1717

18-
Language identification is used to identify languages spoken in audio when compared against a list of [supported languages](language-support.md#language-identification).
18+
Language identification is used to identify languages spoken in audio when compared against a list of [supported languages](language-support.md#speech-to-text-and-text-to-speech).
1919

2020
Language identification (LID) use cases include:
2121

@@ -92,7 +92,7 @@ SPXAutoDetectSourceLanguageConfiguration* autoDetectSourceLanguageConfig = \
9292
9393
::: zone-end
9494
95-
For more information, see [supported languages](language-support.md#language-identification).
95+
For more information, see [supported languages](language-support.md#speech-to-text-and-text-to-speech).
9696
9797
### At-start and Continuous language identification
9898

0 commit comments

Comments
 (0)