Skip to content

Commit 3ad2557

Browse files
committed
additional scenarios
1 parent 139eab5 commit 3ad2557

File tree

3 files changed

+38
-10
lines changed

3 files changed

+38
-10
lines changed

articles/cognitive-services/Speech-Service/includes/quickstarts/speech-to-text-basics/intro.md

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,9 @@ ms.date: 02/12/2022
66
ms.author: eur
77
---
88

9-
In this quickstart, you run an application to recognize and transcribe human speech (often called speech-to-text).
9+
In this quickstart, you create and run an application to recognize and transcribe speech to text in real-time.
1010

1111
> [!TIP]
12-
> You can try speech-to-text in [Speech Studio](https://aka.ms/speechstudio/speechtotexttool) without signing up or writing any code.
12+
> You can try real-time speech-to-text in [Speech Studio](https://aka.ms/speechstudio/speechtotexttool) without signing up or writing any code.
13+
14+
To instead transcribe audio files asynchronously, see [Batch transcription](batch-transcription.md). If you're not sure which speech-to-text solution is right for you, see the [speech to text overview](speech-to-text.md) documentation.

articles/cognitive-services/Speech-Service/overview.md

Lines changed: 19 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -41,15 +41,31 @@ Speech feature summaries are provided below with links for more information.
4141

4242
### Speech-to-text
4343

44-
Use [speech-to-text](speech-to-text.md) to transcribe audio into text, either in real time or asynchronously.
44+
Use [speech-to-text](speech-to-text.md) to transcribe audio into text, either in [real time](#real-time-speech-to-text) or asynchronously with [batch transcription](#batch-transcription).
4545

4646
> [!TIP]
47-
> You can try speech-to-text in [Speech Studio](https://aka.ms/speechstudio/speechtotexttool) without signing up or writing any code.
47+
> You can try real-time speech-to-text in [Speech Studio](https://aka.ms/speechstudio/speechtotexttool) without signing up or writing any code.
4848
49-
Convert audio to text from a range of sources, including microphones, audio files, and blob storage. Use speaker diarisation to determine who said what and when. Get readable transcripts with automatic formatting and punctuation.
49+
Convert audio to text from a range of sources, including microphones, audio files, and blob storage. Use speaker diarization to determine who said what and when. Get readable transcripts with automatic formatting and punctuation.
5050

5151
The base model may not be sufficient if the audio contains ambient noise or includes a lot of industry and domain-specific jargon. In these cases, you can create and train [custom speech models](custom-speech-overview.md) with acoustic, language, and pronunciation data. Custom speech models are private and can offer a competitive advantage.
5252

53+
### Real-time speech-to-text
54+
55+
With [real-time speech-to-text](get-started-speech-to-text.md), the audio is transcribed as speech is recognized from a microphone or file. Use real-time speech-to-text for applications that need to transcribe audio in real-time such as:
56+
- Transcriptions, captions, or subtitles for live meetings
57+
- Contact center agent assist
58+
- Dictation
59+
- Voice agents
60+
- Pronunciation assessment
61+
62+
### Batch transcription
63+
64+
[Batch transcription](batch-transcription) is used to transcribe a large amount of audio in storage. You can point to audio files with a shared access signature (SAS) URI and asynchronously receive transcription results. Use batch transcription for applications that need to transcribe audio in bulk such as:
65+
- Transcriptions, captions, or subtitles for pre-recorded audio
66+
- Contact center post-call analytics
67+
- Diarization
68+
5369
### Text-to-speech
5470

5571
With [text to speech](text-to-speech.md), you can convert input text into humanlike synthesized speech. Use neural voices, which are humanlike voices powered by deep neural networks. Use the [Speech Synthesis Markup Language (SSML)](speech-synthesis-markup.md) to fine-tune the pitch, pronunciation, speaking rate, volume, and more.

articles/cognitive-services/Speech-Service/speech-to-text.md

Lines changed: 15 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -21,17 +21,27 @@ In this overview, you learn about the benefits and capabilities of the speech-to
2121
> [!NOTE]
2222
> To compare pricing of [real-time](#real-time-speech-to-text) to [batch transcription](#batch-transcription), see [Speech service pricing](https://azure.microsoft.com/pricing/details/cognitive-services/speech-services/).
2323
24-
Speech-to-text is available via the [Speech SDK](speech-sdk.md), the [REST API](rest-speech-to-text.md), and the [Speech CLI](spx-overview.md). For a full list of available speech-to-text languages, see [Language and voice support](language-support.md?tabs=stt).
24+
For a full list of available speech-to-text languages, see [Language and voice support](language-support.md?tabs=stt).
2525

2626
## Real-time speech-to-text
2727

28-
With real-time speech-to-text, the audio is transcribed as speech is recognized from a microphone or file. Use real-time speech-to-text for:
29-
- Applications that need to transcribe audio in real-time, such as a call center or a video conference.
30-
- Keyword spotting to detect a specific word or phrase. For example, you can detect the phrase "hey copilot" in the audio and then take action. For more information, see [keyword recognition](keyword-recognition-overview.md).
28+
With real-time speech-to-text, the audio is transcribed as speech is recognized from a microphone or file. Use real-time speech-to-text for applications that need to transcribe audio in real-time such as:
29+
- Transcriptions, captions, or subtitles for live meetings
30+
- Contact center agent assist
31+
- Dictation
32+
- Voice agents
33+
- Pronunciation assessment
34+
35+
Real-time speech to text is available via the [Speech SDK](speech-sdk.md) and the [Speech CLI](spx-overview.md).
3136

3237
## Batch transcription
3338

34-
Batch transcription is used to transcribe a large amount of audio in storage. You can point to audio files with a shared access signature (SAS) URI and asynchronously receive transcription results. Batch transcription is available via:
39+
Batch transcription is used to transcribe a large amount of audio in storage. You can point to audio files with a shared access signature (SAS) URI and asynchronously receive transcription results. Use batch transcription for applications that need to transcribe audio in bulk such as:
40+
- Transcriptions, captions, or subtitles for pre-recorded audio
41+
- Contact center post-call analytics
42+
- Diarization
43+
44+
Batch transcription is available via:
3545
- [Speech-to-text REST API](rest-speech-to-text.md): To get started, see [How to use batch transcription](batch-transcription.md) and [Batch transcription samples (REST)](https://github.com/Azure-Samples/cognitive-services-speech-sdk/tree/master/samples/batch).
3646
- The [Speech CLI](spx-overview.md) supports both real-time and batch transcription. For Speech CLI help with batch transcriptions, run the following command:
3747
```azurecli-interactive

0 commit comments

Comments
 (0)