You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/cognitive-services/Speech-Service/overview.md
+25-22Lines changed: 25 additions & 22 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,24 +7,26 @@ author: v-jerkin
7
7
8
8
ms.service: cognitive-services
9
9
ms.component: speech-service
10
-
ms.topic: article
10
+
ms.topic: overview
11
11
ms.date: 05/07/2018
12
12
ms.author: v-jerkin
13
13
---
14
14
# What is the Speech service?
15
15
16
-
The Speech service provides a powerful collection of related speech features in the Microsoft Azure cloud. These features were previously available via the [Bing Speech API](https://docs.microsoft.com/azure/cognitive-services/speech/home), [Translator Speech](https://docs.microsoft.com/azure/cognitive-services/translator-speech/), [Custom Speech](https://docs.microsoft.com/azure/cognitive-services/custom-speech-service/cognitive-services-custom-speech-home), and [Custom Voice](http://customvoice.ai/) services. Now, one subscription gets you access to all of these Azure speech features.
17
-
18
-
To simplify the development of speech-enabled applications, Microsoft created a unified [Speech SDK](speech-sdk.md) for use with the new Speech service. The SDK provides consistent native Speech to Text and Speech Translation APIs for C#, C++, and Java. If you're developing with one of these languages, the Speech SDK makes development easier by handling the network details for you.
19
-
20
-
Microsoft also offers a [Speech Devices SDK](speech-devices-sdk.md), an integrated hardware and software platform for developers of speech-enabled devices. Our hardware partner provides reference designs and development units, while we provide a device-optimized SDK for the best possible results.
16
+
The Speech service unites the Azure speech features previously available via the [Bing Speech API](https://docs.microsoft.com/azure/cognitive-services/speech/home), [Translator Speech](https://docs.microsoft.com/azure/cognitive-services/translator-speech/), [Custom Speech](https://docs.microsoft.com/azure/cognitive-services/custom-speech-service/cognitive-services-custom-speech-home), and [Custom Voice](http://customvoice.ai/) services. Now, one subscription provides access to all of these capabilities.
21
17
22
18
Like the other Azure speech services, the Speech service is powered by the proven speech technologies used in products like Cortana and Microsoft Office. You can count on the quality of the results and the reliability of the Azure cloud.
23
19
24
20
> [!NOTE]
25
21
> The Speech service is currently in public preview. Return here regularly for documentation updates, new code samples, and more.
26
22
27
-
## Speech service functions
23
+
## Speech SDK and Speech Devices SDK
24
+
25
+
To simplify the development of speech-enabled applications, Microsoft provides the [Speech SDK](speech-sdk.md) for use with the new Speech service. The Speech SDK provides consistent native Speech to Text and Speech Translation APIs for C#, C++, and Java. If you're developing with one of these languages, the Speech SDK makes development easier by handling the network details for you.
26
+
27
+
Microsoft also offers a [Speech Devices SDK](speech-devices-sdk.md), an integrated hardware and software platform for developers of speech-enabled devices. Our hardware partner provides reference designs and development units, while we provide a device-optimized SDK that takes full advantage of the hardware's capabilities.
28
+
29
+
## Primary Speech service functions
28
30
29
31
The primary functions of the Speech service are Speech to Text (also called speech recognition or transcription), Text to Speech (speech synthesis), and Speech Translation.
30
32
@@ -37,7 +39,19 @@ The primary functions of the Speech service are Speech to Text (also called spee
37
39
\**Intent recognition requires a LUIS subscription.*
38
40
39
41
40
-
## Customizing Speech functions
42
+
## Using the Speech service
43
+
44
+
If you're using a supported programming language, the [Speech SDK](speech-sdk.md) makes development easier. The Speech service also has a [REST API](rest-apis.md) works with any programming language, but does not offer all the functions offered by the SDK.
|[Speech SDK](speech-sdk.md)|Yes|No|Yes|Native APIs for C#, C++, and Java to simplify development.|
49
+
|[REST](rest-apis.md)|Yes|Yes|No|A simple HTTP-based API that makes it easy to add speech to your applications.|
50
+
51
+
The Speech service also provides WebSockets protocols for streaming Speech to Text and Speech Translation. The Speech SDKs use these protocols. We encourage you to use the Speech SDK rather than trying to implement your own WebSockets communication with the Speech service. However, if you already have code that uses Bing Speech or Translator Speech via WebSockets, it is straightforward to update it to use the Speech service. The WebSockets protocols are compatible; only the endpoints are different.
52
+
53
+
54
+
## Customizing speech features
41
55
42
56
The Speech service lets you use your own data to train the models underlying the Speech service's Speech to Text and Text to Speech features.
43
57
@@ -51,28 +65,17 @@ The Speech service lets you use your own data to train the models underlying the
51
65
Once created, your custom models can be used anywhere you'd use the standard models in your app's Speech to Text or Text to Speech functionality.
52
66
53
67
54
-
## Using the Speech service in your applications
55
-
56
-
There are two ways for applications to use the Speech service. If you're using a supported programming language, the [Speech SDK](speech-sdk.md) makes development easier. The [REST API](rest-apis.md) works with any programming language, but does not offer all the functions offered by the SDK.
|[Speech SDK](speech-sdk.md)|Yes|No|Yes|Native APIs for C#, C++, and Java to simplify development.|
61
-
|[REST](rest-apis.md)|Yes|Yes|No|A simple HTTP-based API that makes it easy to add speech to your applications.|
62
-
63
-
The Speech service provides WebSockets protocols for streaming Speech to Text and Speech Translation. The Speech SDKs use these protocols. We encourage you to use the Speech SDK rather than trying to implement your own WebSockets communication with the Speech service. However, if you already have code that uses Bing Speech or Translator Speech via WebSockets, it is straightforward to update it to use the Speech service. The WebSockets protocols are compatible; only the endpoints are different.
64
-
65
68
66
69
## Speech scenarios
67
70
68
-
A few example use cases for the Speech service are discussed briefly below.
71
+
Use cases for the Speech service include:
69
72
70
73
> [!div class="checklist"]
71
74
> * Create voice-triggered apps
72
75
> * Transcribe call center recordings
73
76
> * Implement voice bots
74
77
75
-
### Voice-triggered apps
78
+
### Voice user interface
76
79
77
80
Voice input is a great way to make your app flexible, hands-free, and quick to use. In a voice-enabled app, users can just ask for the information they want rather than needing to navigate to it.
78
81
@@ -84,7 +87,7 @@ If your app will be used in a noisy environment, such as a factory, you can crea
84
87
85
88
Getting started is as easy as downloading the [Speech SDK](speech-sdk.md) and following a relevant [Quickstart](quickstart-csharp-dotnet-windows.md) article.
86
89
87
-
### Transcribe call center recordings
90
+
### Call center transcription
88
91
89
92
Often, call center recordings are only consulted if an issue arises with a call. With the Speech service, it's easy to transcribe every recording to text. Once they're text, you can easily index them for [full-text search](https://docs.microsoft.com/azure/search/search-what-is-azure-search) or apply [Text Analytics](https://docs.microsoft.com/azure/cognitive-services/Text-Analytics/) to detect sentiment, language, and key phrases.
0 commit comments