You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/cognitive-services/Speech-Service/includes/quickstarts/speech-to-text-basics/intro.md
+4-2Lines changed: 4 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,7 +6,9 @@ ms.date: 02/12/2022
6
6
ms.author: eur
7
7
---
8
8
9
-
In this quickstart, you run an application to recognize and transcribe human speech (often called speech-to-text).
9
+
In this quickstart, you create and run an application to recognize and transcribe speech to text in real-time.
10
10
11
11
> [!TIP]
12
-
> You can try speech-to-text in [Speech Studio](https://aka.ms/speechstudio/speechtotexttool) without signing up or writing any code.
12
+
> You can try real-time speech-to-text in [Speech Studio](https://aka.ms/speechstudio/speechtotexttool) without signing up or writing any code.
13
+
14
+
To instead transcribe audio files asynchronously, see [Batch transcription](batch-transcription.md). If you're not sure which speech-to-text solution is right for you, see the [speech to text overview](speech-to-text.md) documentation.
Copy file name to clipboardExpand all lines: articles/cognitive-services/Speech-Service/overview.md
+19-3Lines changed: 19 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -41,15 +41,31 @@ Speech feature summaries are provided below with links for more information.
41
41
42
42
### Speech-to-text
43
43
44
-
Use [speech-to-text](speech-to-text.md) to transcribe audio into text, either in real time or asynchronously.
44
+
Use [speech-to-text](speech-to-text.md) to transcribe audio into text, either in [real time](#real-time-speech-to-text) or asynchronously with [batch transcription](#batch-transcription).
45
45
46
46
> [!TIP]
47
-
> You can try speech-to-text in [Speech Studio](https://aka.ms/speechstudio/speechtotexttool) without signing up or writing any code.
47
+
> You can try real-time speech-to-text in [Speech Studio](https://aka.ms/speechstudio/speechtotexttool) without signing up or writing any code.
48
48
49
-
Convert audio to text from a range of sources, including microphones, audio files, and blob storage. Use speaker diarisation to determine who said what and when. Get readable transcripts with automatic formatting and punctuation.
49
+
Convert audio to text from a range of sources, including microphones, audio files, and blob storage. Use speaker diarization to determine who said what and when. Get readable transcripts with automatic formatting and punctuation.
50
50
51
51
The base model may not be sufficient if the audio contains ambient noise or includes a lot of industry and domain-specific jargon. In these cases, you can create and train [custom speech models](custom-speech-overview.md) with acoustic, language, and pronunciation data. Custom speech models are private and can offer a competitive advantage.
52
52
53
+
### Real-time speech-to-text
54
+
55
+
With [real-time speech-to-text](get-started-speech-to-text.md), the audio is transcribed as speech is recognized from a microphone or file. Use real-time speech-to-text for applications that need to transcribe audio in real-time such as:
56
+
- Transcriptions, captions, or subtitles for live meetings
57
+
- Contact center agent assist
58
+
- Dictation
59
+
- Voice agents
60
+
- Pronunciation assessment
61
+
62
+
### Batch transcription
63
+
64
+
[Batch transcription](batch-transcription) is used to transcribe a large amount of audio in storage. You can point to audio files with a shared access signature (SAS) URI and asynchronously receive transcription results. Use batch transcription for applications that need to transcribe audio in bulk such as:
65
+
- Transcriptions, captions, or subtitles for pre-recorded audio
66
+
- Contact center post-call analytics
67
+
- Diarization
68
+
53
69
### Text-to-speech
54
70
55
71
With [text to speech](text-to-speech.md), you can convert input text into humanlike synthesized speech. Use neural voices, which are humanlike voices powered by deep neural networks. Use the [Speech Synthesis Markup Language (SSML)](speech-synthesis-markup.md) to fine-tune the pitch, pronunciation, speaking rate, volume, and more.
Copy file name to clipboardExpand all lines: articles/cognitive-services/Speech-Service/speech-to-text.md
+15-5Lines changed: 15 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -21,17 +21,27 @@ In this overview, you learn about the benefits and capabilities of the speech-to
21
21
> [!NOTE]
22
22
> To compare pricing of [real-time](#real-time-speech-to-text) to [batch transcription](#batch-transcription), see [Speech service pricing](https://azure.microsoft.com/pricing/details/cognitive-services/speech-services/).
23
23
24
-
Speech-to-text is available via the [Speech SDK](speech-sdk.md), the [REST API](rest-speech-to-text.md), and the [Speech CLI](spx-overview.md). For a full list of available speech-to-text languages, see [Language and voice support](language-support.md?tabs=stt).
24
+
For a full list of available speech-to-text languages, see [Language and voice support](language-support.md?tabs=stt).
25
25
26
26
## Real-time speech-to-text
27
27
28
-
With real-time speech-to-text, the audio is transcribed as speech is recognized from a microphone or file. Use real-time speech-to-text for:
29
-
- Applications that need to transcribe audio in real-time, such as a call center or a video conference.
30
-
- Keyword spotting to detect a specific word or phrase. For example, you can detect the phrase "hey copilot" in the audio and then take action. For more information, see [keyword recognition](keyword-recognition-overview.md).
28
+
With real-time speech-to-text, the audio is transcribed as speech is recognized from a microphone or file. Use real-time speech-to-text for applications that need to transcribe audio in real-time such as:
29
+
- Transcriptions, captions, or subtitles for live meetings
30
+
- Contact center agent assist
31
+
- Dictation
32
+
- Voice agents
33
+
- Pronunciation assessment
34
+
35
+
Real-time speech to text is available via the [Speech SDK](speech-sdk.md) and the [Speech CLI](spx-overview.md).
31
36
32
37
## Batch transcription
33
38
34
-
Batch transcription is used to transcribe a large amount of audio in storage. You can point to audio files with a shared access signature (SAS) URI and asynchronously receive transcription results. Batch transcription is available via:
39
+
Batch transcription is used to transcribe a large amount of audio in storage. You can point to audio files with a shared access signature (SAS) URI and asynchronously receive transcription results. Use batch transcription for applications that need to transcribe audio in bulk such as:
40
+
- Transcriptions, captions, or subtitles for pre-recorded audio
41
+
- Contact center post-call analytics
42
+
- Diarization
43
+
44
+
Batch transcription is available via:
35
45
-[Speech-to-text REST API](rest-speech-to-text.md): To get started, see [How to use batch transcription](batch-transcription.md) and [Batch transcription samples (REST)](https://github.com/Azure-Samples/cognitive-services-speech-sdk/tree/master/samples/batch).
36
46
- The [Speech CLI](spx-overview.md) supports both real-time and batch transcription. For Speech CLI help with batch transcriptions, run the following command:
0 commit comments