Skip to content

Commit 4c61ab9

Browse files
committed
resolve links and acrolinx issues
1 parent 3b3f8bf commit 4c61ab9

File tree

1 file changed

+21
-17
lines changed

1 file changed

+21
-17
lines changed

articles/cognitive-services/Speech-Service/gaming-concepts.md

Lines changed: 21 additions & 17 deletions
Original file line numberDiff line numberDiff line change
@@ -8,56 +8,60 @@ manager: nitinme
88
ms.service: cognitive-services
99
ms.subservice: speech-service
1010
ms.topic: conceptual
11-
ms.date: 01/11/2023
11+
ms.date: 01/20/2023
1212
ms.author: eur
1313
---
1414

1515
# Game development with Azure Cognitive Service for Speech
1616

17-
Azure Cognitive Services for Speech can be used to improve a variety of gaming scenarios, both in- and out-of-game.
17+
Azure Cognitive Services for Speech can be used to improve various gaming scenarios, both in- and out-of-game.
1818

1919
Here are a few Speech features to consider for flexible and interactive game experiences:
2020

2121
- Bring everyone into the conversation by synthesizing audio from text. Or by displaying text from audio.
22-
- Make the game more accessible for players who are unable to read text in a particular language, including young players who haven't learnt to read and write. Players can listen to storylines and instructions in their preferred language.
22+
- Make the game more accessible for players who are unable to read text in a particular language, including young players who haven't learned to read and write. Players can listen to storylines and instructions in their preferred language.
2323
- Create game avatars and non-playable characters (NPC) that can initiate or participate in a conversation in-game.
2424
- Prebuilt neural voice can provide highly natural out-of-box voices with leading voice variety in terms of a large portfolio of languages and voices.
25-
- Custom neural voice for creating a voice that stays on-brand with consistent quality and speaking style. You can add emotions, accents, nuances, laughter, and other paralinguistic sounds and expressions.
25+
- Custom neural voice for creating a voice that stays on-brand with consistent quality and speaking style. You can add emotions, accents, nuances, laughter, and other para linguistic sounds and expressions.
2626
- Use game dialogue prototyping to shorten the amount of time and money spent in product to get the game to market sooner. You can rapidly swap lines of dialog and listen to variations in real-time to iterate the game content.
2727

28-
You can use the [Speech SDK](speech-sdk.md) or [Speech CLI](spx-overview.md) for speech-to-text, text-to-speech, language identification, and speech translation. You can also use the [Batch transcription API](batch-transcription.md) to transcribe pre-recorded speech to text. To synthesize a large volume of text input (long and short) to speech, use the [Batch synthesis API](batch-synthesis.md).
28+
You can use the [Speech SDK](speech-sdk.md) or [Speech CLI](spx-overview.md) for real-time low latency speech-to-text, text-to-speech, language identification, and speech translation. You can also use the [Batch transcription API](batch-transcription.md) to transcribe pre-recorded speech to text. To synthesize a large volume of text input (long and short) to speech, use the [Batch synthesis API](batch-synthesis.md).
29+
30+
For information about locale and regional availability, see [Language and voice support](language-support.md) and [Region support](regions.md).
2931

3032
## Text-to-speech
3133

32-
Help bring everyone into the conversation by converting text messages to audio using Text-to-Speech for scenarios, such as game dialogue prototyping, greater accessibility, or non-playable character (NPC) voices. Text-to-Speech includes prebuilt neural voice and custom neural voice features. Prebuilt neural voice can provide highly natural out-of-box voices with leading voice variety in terms of a large portfolio of languages and voices. Custom neural voice is an easy-to-use self-service for creating a highly natural custom voice. For details on the two features and voice samples, see Text-to-Speech overview.
34+
Help bring everyone into the conversation by converting text messages to audio using [Text-to-Speech](text-to-speech.md) for scenarios, such as game dialogue prototyping, greater accessibility, or non-playable character (NPC) voices. Text-to-Speech includes [prebuilt neural voice](language-support.md?tabs=tts#prebuilt-neural-voices) and [custom neural voice](language-support.md?tabs=tts#custom-neural-voices) features. Prebuilt neural voice can provide highly natural out-of-box voices with leading voice variety in terms of a large portfolio of languages and voices. Custom neural voice is an easy-to-use self-service for creating a highly natural custom voice.
3335

3436
When enabling this functionality in your game, keep in mind the following benefits:
3537

36-
- Voices and languages supported - A large portfolio of languages and variants are supported. For a complete list of voices, see the prebuilt neural voice and custom neural voice. You can also specify multiple languages for Text-to-Speech output. For details, see how to adjust speaking languages. For custom neural voice, you can select to create different languages from single language training data.
37-
- Emotional styles supported - Emotional tones, such as cheerful, angry, sad, excited, hopeful, friendly, unfriendly, terrified, shouting, and whispering. You can adjust the speaking style, style degree, and role at the sentence level.
38-
- Low latency real-time speech synthesis supported - Use the Speech SDK to convert text to speech by using prebuilt neural voices or custom neural voices.
39-
- Visemes supported - You can use visemes during real-time synthesizing to control the movement of 2D and 3D avatar models, so that the mouth movements are perfectly matched to synthetic speech.
40-
Fine-tuning Text-to-Speech output with Speech Synthesis Markup Language (SSML) - With SSML, you can customize Text-to-Speech outputs, with richer voice tuning supports. For more details, see how to improve synthesis with SSML.
41-
- Audio outputs - There is a list of supported audio formats. Each incorporates a bitrate and encoding type. Each prebuilt neural voice model is available at 24kHz and high-fidelity 48kHz. If you select 48kHz output format, the high-fidelity voice model with 48kHz will be invoked accordingly. The sample rates other than 24kHz and 48kHz can be obtained through upsampling or downsampling when synthesizing, for example, 44.1kHz is downsampled from 48kHz. For more tech details on 48kHz high-quality voices, see this introduction blog.
42-
Regions supported - For information about regional availability, see the regions topic.
38+
- Voices and languages supported - A large portfolio of [locales and voices](language-support.md?tabs=tts#supported-languages) are supported. You can also [specify multiple languages](speech-synthesis-markup-voice.md#adjust-speaking-languages) for Text-to-Speech output. For [custom neural voice](custom-neural-voice.md), you can [choose to create](how-to-custom-voice-create-voice.md?tabs=neural#choose-a-training-method) different languages from single language training data.
39+
- Emotional styles supported - [Emotional tones](language-support.md?tabs=tts#voice-styles-and-roles), such as cheerful, angry, sad, excited, hopeful, friendly, unfriendly, terrified, shouting, and whispering. You can [adjust the speaking style](speech-synthesis-markup-voice.md#speaking-styles-and-roles), style degree, and role at the sentence level.
40+
- Visemes supported - You can use visemes during real-time synthesizing to control the movement of 2D and 3D avatar models, so that the mouth movements are perfectly matched to synthetic speech. For more information, see [Get facial position with viseme](how-to-speech-synthesis-viseme.md).
41+
- Fine-tuning Text-to-Speech output with Speech Synthesis Markup Language (SSML) - With SSML, you can customize Text-to-Speech outputs, with richer voice tuning supports. For more information, see [Speech Synthesis Markup Language (SSML) overview](speech-synthesis-markup.md).
42+
- Audio outputs - Each prebuilt neural voice model is available at 24 kHz and high-fidelity 48 kHz. If you select 48-kHz output format, the high-fidelity voice model with 48 kHz will be invoked accordingly. The sample rates other than 24 kHz and 48 kHz can be obtained through upsampling or downsampling when synthesizing. For example, 44.1 kHz is downsampled from 48 kHz. Each audio format incorporates a bitrate and encoding type. For more information, see the [supported audio formats](rest-text-to-speech.md?tabs=streaming#audio-outputs). For more information on 48-kHz high-quality voices, see [this introduction blog](https://techcommunity.microsoft.com/t5/ai-cognitive-services-blog/azure-neural-tts-voices-upgraded-to-48khz-with-hifinet2-vocoder/ba-p/3665252).
43+
44+
For an example, see the [Text-to-speech quickstart](get-started-text-to-speech.md).
4345

4446
## Speech-to-text
4547

46-
You can use speech-to-text to display text from the spoken audio in your game.
48+
You can use [speech-to-text](speech-to-text.md) to display text from the spoken audio in your game. For an example, see the [Speech-to-text quickstart](get-started-speech-to-text.md).
4749

4850
## Language identification
4951

50-
Language identification can detect the language of the chat string submitted by the player.
52+
With [language identification](language-identification.md), you can detect the language of the chat string submitted by the player.
5153

5254
## Speech translation
5355

54-
It's not unusual that players in the same game session natively speak different languages and may appreciate receiving both the original message and its translation.
56+
It's not unusual that players in the same game session natively speak different languages and may appreciate receiving both the original message and its translation. You can use [speech translation](speech-translation.md) to translate text between languages so players across the world can communicate with each other in their native language.
5557

58+
For an example, see the [Speech translation quickstart](get-started-speech-translation.md).
5659

60+
> [!NOTE]
61+
> Besides the Speech service, you can also use the [Translator service](/azure/cognitive-services/translator/translator-overview). To execute text translation between supported source and target languages in real time see [Text translation](/azure/cognitive-services/translator/text-translation-overview).
5762
5863
## Next steps
5964

6065
* [Text-to-speech quickstart](get-started-text-to-speech.md)
61-
* [Get facial position with viseme](how-to-speech-synthesis-viseme.md)
6266
* [Speech-to-text quickstart](get-started-speech-to-text.md)
6367
* [Speech translation quickstart](get-started-speech-translation.md)

0 commit comments

Comments
 (0)