Skip to content

Commit a988042

Browse files
author
v-jerkin
committed
further tweaks
1 parent f7e0820 commit a988042

File tree

1 file changed

+25
-22
lines changed

1 file changed

+25
-22
lines changed

articles/cognitive-services/Speech-Service/overview.md

Lines changed: 25 additions & 22 deletions
Original file line numberDiff line numberDiff line change
@@ -7,24 +7,26 @@ author: v-jerkin
77

88
ms.service: cognitive-services
99
ms.component: speech-service
10-
ms.topic: article
10+
ms.topic: overview
1111
ms.date: 05/07/2018
1212
ms.author: v-jerkin
1313
---
1414
# What is the Speech service?
1515

16-
The Speech service provides a powerful collection of related speech features in the Microsoft Azure cloud. These features were previously available via the [Bing Speech API](https://docs.microsoft.com/azure/cognitive-services/speech/home), [Translator Speech](https://docs.microsoft.com/azure/cognitive-services/translator-speech/), [Custom Speech](https://docs.microsoft.com/azure/cognitive-services/custom-speech-service/cognitive-services-custom-speech-home), and [Custom Voice](http://customvoice.ai/) services. Now, one subscription gets you access to all of these Azure speech features.
17-
18-
To simplify the development of speech-enabled applications, Microsoft created a unified [Speech SDK](speech-sdk.md) for use with the new Speech service. The SDK provides consistent native Speech to Text and Speech Translation APIs for C#, C++, and Java. If you're developing with one of these languages, the Speech SDK makes development easier by handling the network details for you.
19-
20-
Microsoft also offers a [Speech Devices SDK](speech-devices-sdk.md), an integrated hardware and software platform for developers of speech-enabled devices. Our hardware partner provides reference designs and development units, while we provide a device-optimized SDK for the best possible results.
16+
The Speech service unites the Azure speech features previously available via the [Bing Speech API](https://docs.microsoft.com/azure/cognitive-services/speech/home), [Translator Speech](https://docs.microsoft.com/azure/cognitive-services/translator-speech/), [Custom Speech](https://docs.microsoft.com/azure/cognitive-services/custom-speech-service/cognitive-services-custom-speech-home), and [Custom Voice](http://customvoice.ai/) services. Now, one subscription provides access to all of these capabilities.
2117

2218
Like the other Azure speech services, the Speech service is powered by the proven speech technologies used in products like Cortana and Microsoft Office. You can count on the quality of the results and the reliability of the Azure cloud.
2319

2420
> [!NOTE]
2521
> The Speech service is currently in public preview. Return here regularly for documentation updates, new code samples, and more.
2622
27-
## Speech service functions
23+
## Speech SDK and Speech Devices SDK
24+
25+
To simplify the development of speech-enabled applications, Microsoft provides the [Speech SDK](speech-sdk.md) for use with the new Speech service. The Speech SDK provides consistent native Speech to Text and Speech Translation APIs for C#, C++, and Java. If you're developing with one of these languages, the Speech SDK makes development easier by handling the network details for you.
26+
27+
Microsoft also offers a [Speech Devices SDK](speech-devices-sdk.md), an integrated hardware and software platform for developers of speech-enabled devices. Our hardware partner provides reference designs and development units, while we provide a device-optimized SDK that takes full advantage of the hardware's capabilities.
28+
29+
## Primary Speech service functions
2830

2931
The primary functions of the Speech service are Speech to Text (also called speech recognition or transcription), Text to Speech (speech synthesis), and Speech Translation.
3032

@@ -37,7 +39,19 @@ The primary functions of the Speech service are Speech to Text (also called spee
3739
\* *Intent recognition requires a LUIS subscription.*
3840

3941

40-
## Customizing Speech functions
42+
## Using the Speech service
43+
44+
If you're using a supported programming language, the [Speech SDK](speech-sdk.md) makes development easier. The Speech service also has a [REST API](rest-apis.md) works with any programming language, but does not offer all the functions offered by the SDK.
45+
46+
|<br>Method|Speech<br>to Text|Text to<br>Speech|Speech<br>Translation|<br>Description|
47+
|-|-|-|-|-|
48+
|[Speech SDK](speech-sdk.md)|Yes|No|Yes|Native APIs for C#, C++, and Java to simplify development.|
49+
|[REST](rest-apis.md)|Yes|Yes|No|A simple HTTP-based API that makes it easy to add speech to your applications.|
50+
51+
The Speech service also provides WebSockets protocols for streaming Speech to Text and Speech Translation. The Speech SDKs use these protocols. We encourage you to use the Speech SDK rather than trying to implement your own WebSockets communication with the Speech service. However, if you already have code that uses Bing Speech or Translator Speech via WebSockets, it is straightforward to update it to use the Speech service. The WebSockets protocols are compatible; only the endpoints are different.
52+
53+
54+
## Customizing speech features
4155

4256
The Speech service lets you use your own data to train the models underlying the Speech service's Speech to Text and Text to Speech features.
4357

@@ -51,28 +65,17 @@ The Speech service lets you use your own data to train the models underlying the
5165
Once created, your custom models can be used anywhere you'd use the standard models in your app's Speech to Text or Text to Speech functionality.
5266

5367

54-
## Using the Speech service in your applications
55-
56-
There are two ways for applications to use the Speech service. If you're using a supported programming language, the [Speech SDK](speech-sdk.md) makes development easier. The [REST API](rest-apis.md) works with any programming language, but does not offer all the functions offered by the SDK.
57-
58-
|<br>Method|Speech<br>to Text|Text to<br>Speech|Speech<br>Translation|<br>Description|
59-
|-|-|-|-|-|
60-
|[Speech SDK](speech-sdk.md)|Yes|No|Yes|Native APIs for C#, C++, and Java to simplify development.|
61-
|[REST](rest-apis.md)|Yes|Yes|No|A simple HTTP-based API that makes it easy to add speech to your applications.|
62-
63-
The Speech service provides WebSockets protocols for streaming Speech to Text and Speech Translation. The Speech SDKs use these protocols. We encourage you to use the Speech SDK rather than trying to implement your own WebSockets communication with the Speech service. However, if you already have code that uses Bing Speech or Translator Speech via WebSockets, it is straightforward to update it to use the Speech service. The WebSockets protocols are compatible; only the endpoints are different.
64-
6568

6669
## Speech scenarios
6770

68-
A few example use cases for the Speech service are discussed briefly below.
71+
Use cases for the Speech service include:
6972

7073
> [!div class="checklist"]
7174
> * Create voice-triggered apps
7275
> * Transcribe call center recordings
7376
> * Implement voice bots
7477
75-
### Voice-triggered apps
78+
### Voice user interface
7679

7780
Voice input is a great way to make your app flexible, hands-free, and quick to use. In a voice-enabled app, users can just ask for the information they want rather than needing to navigate to it.
7881

@@ -84,7 +87,7 @@ If your app will be used in a noisy environment, such as a factory, you can crea
8487

8588
Getting started is as easy as downloading the [Speech SDK](speech-sdk.md) and following a relevant [Quickstart](quickstart-csharp-dotnet-windows.md) article.
8689

87-
### Transcribe call center recordings
90+
### Call center transcription
8891

8992
Often, call center recordings are only consulted if an issue arises with a call. With the Speech service, it's easy to transcribe every recording to text. Once they're text, you can easily index them for [full-text search](https://docs.microsoft.com/azure/search/search-what-is-azure-search) or apply [Text Analytics](https://docs.microsoft.com/azure/cognitive-services/Text-Analytics/) to detect sentiment, language, and key phrases.
9093

0 commit comments

Comments
 (0)