You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The Speech service is the unification of speech-to-text, text-to-speech, and speech-translation into a single Azure subscription. It's easy to speech enable your applications, tools, and devices with the [Speech SDK](speech-sdk-reference.md), [Speech Devices SDK](https://aka.ms/sdsdk-quickstart), or [REST APIs](rest-apis.md).
20
18
21
19
> [!IMPORTANT]
@@ -25,8 +23,8 @@ These features make up the Speech service. Use the links in this table to learn
25
23
26
24
| Service | Feature | Description | SDK | REST |
27
25
|---------|---------|-------------|-----|------|
28
-
|[Speech-to-Text](speech-to-text.md)| Speech-to-text | Speech-to-text transcribes audio streams to text in real time that your applications, tools, or devices can consume or display. Use speech-to-text with [Language Understanding (LUIS)](https://docs.microsoft.com/azure/cognitive-services/luis/) to derive user intents from transcribed speech and act on voice commands. |[Yes](https://docs.microsoft.com/azure/cognitive-services/speech-service/speech-sdk-reference)|[Yes](https://docs.microsoft.com/azure/cognitive-services/speech-service/rest-apis)|
29
-
||[Batch Transcription](batch-transcription.md)| Batch transcription enables asynchronous speech-to-text transcription of large volumes of data. This is a REST-based service, which uses same endpoint as customization and model management. | No |[Yes](https://westus.cris.ai/swagger/ui/index)|
26
+
|[Realtime Speech-to-Text](speech-to-text.md)| Speech-to-text | Speech-to-text transcribes or translates audio streams or local files to text in realtime that your applications, tools, or devices can consume or display. Use speech-to-text with [Language Understanding (LUIS)](https://docs.microsoft.com/azure/cognitive-services/luis/) to derive user intents from transcribed speech and act on voice commands. |[Yes](https://docs.microsoft.com/azure/cognitive-services/speech-service/speech-sdk-reference)|[Yes](https://docs.microsoft.com/azure/cognitive-services/speech-service/rest-apis)|
27
+
||[Batch Speech-to-Text](batch-transcription.md)| Batch Speech-to-text enables asynchronous speech-to-text transcription of large volumes of speech audio data in files where the files are stored in Azure Blob Storage. In addition to the converting speech audio to text, Batch Speech-to-text also allows for diarization and sentiment-analysis. This is a REST-based service. | No |[Yes](https://westus.cris.ai/swagger/ui/index)|
30
28
||[Multi-device Conversation](multi-device-conversation.md)| Connect multiple devices or clients in a conversation to send speech- or text-based messages, with easy support for transcription and translation| Yes | No |
31
29
||[Conversation Transcription](conversation-transcription-service.md)| Enables real-time speech recognition, speaker identification, and diarization. It's perfect for transcribing in-person meetings with the ability to distinguish speakers. | Yes | No |
32
30
||[Create Custom Speech Models](#customize-your-speech-experience)| If you are using speech-to-text for recognition and transcription in a unique environment, you can create and train custom acoustic, language, and pronunciation models to address ambient noise or industry-specific vocabulary. | No |[Yes](https://westus.cris.ai/swagger/ui/index)|
@@ -35,6 +33,9 @@ These features make up the Speech service. Use the links in this table to learn
35
33
|[Speech Translation](speech-translation.md)| Speech translation | Speech translation enables real-time, multi-language translation of speech to your applications, tools, and devices. Use this service for speech-to-speech and speech-to-text translation. |[Yes](https://docs.microsoft.com/azure/cognitive-services/speech-service/speech-sdk-reference)| No |
36
34
|[Voice assistants](voice-assistants.md)| Voice assistants | Voice assistants using the Speech service empower developers to create natural, human-like conversational interfaces for their applications and experiences. The voice assistant service provides fast, reliable interaction between a device and an assistant implementation that uses the Bot Framework's Direct Line Speech channel or the integrated Custom Commands (Preview) service for task completion. |[Yes](voice-assistants.md)| No |
We offer quickstarts in most popular programming languages, each designed to have you running code in less than 10 minutes. This table contains the most popular quickstarts for each feature. Use the left-hand navigation to explore additional languages and platforms.
0 commit comments