Skip to content

Commit a24e58b

Browse files
authored
Merge pull request #107884 from IEvangelist/chrisUpdates
Updates based on collaboration with Chris
2 parents a382244 + aed4032 commit a24e58b

File tree

8 files changed

+51
-42
lines changed

8 files changed

+51
-42
lines changed

articles/cognitive-services/Speech-Service/batch-transcription.md

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ manager: nitinme
88
ms.service: cognitive-services
99
ms.subservice: speech-service
1010
ms.topic: conceptual
11-
ms.date: 12/17/2019
11+
ms.date: 03/16/2020
1212
ms.author: panosper
1313
---
1414

@@ -291,6 +291,9 @@ The transcription service can handle large number of submitted transcriptions. Y
291291

292292
Complete samples are available in the [GitHub sample repository](https://aka.ms/csspeech/samples) inside the `samples/batch` subdirectory.
293293

294+
> [!NOTE]
295+
> Batch transcription functionality is exposed via the REST API described above. Thus Batch transcription can be used from nearly any programming language or environment that supports REST. The examples below and samples in GitHub are merely representative and **do not** connote limits on where the API can be used.
296+
294297
You have to customize the sample code with your subscription information, the service region, the SAS URI pointing to the audio file to transcribe, and model IDs in case you want to use a custom acoustic or language model.
295298

296299
[!code-csharp[Configuration variables for batch transcription](~/samples-cognitive-services-speech-sdk/samples/batch/csharp/program.cs#batchdefinition)]

articles/cognitive-services/Speech-Service/conversation-transcription.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,18 +1,18 @@
11
---
22
title: Conversation Transcription (Preview) - Speech service
33
titleSuffix: Azure Cognitive Services
4-
description: Conversation Transcription is a speech-to-text solution that combines speech recognition, speaker identification, and sentence attribution to each speaker (also known as diarization) to provide real-time and/or asynchronous transcription of any conversation.
4+
description: Conversation Transcription is a solution for meetings, that combines recognition, speaker ID, and diarization to provide transcription of any conversation.
55
services: cognitive-services
66
author: IEvangelist
77
manager: nitinme
88
ms.service: cognitive-services
99
ms.subservice: speech-service
1010
ms.topic: conceptual
11-
ms.date: 03/11/2020
11+
ms.date: 03/16/2020
1212
ms.author: dapine
1313
---
1414

15-
# What is Conversation Transcription (Preview)?
15+
# What is Conversation Transcription in meetings (Preview)?
1616

1717
Conversation Transcription is a [speech-to-text](speech-to-text.md) solution that combines speech recognition, speaker identification, and sentence attribution to each speaker (also known as _diarization_) to provide real-time and/or asynchronous transcription of any conversation. Conversation Transcription distinguishes speakers in a conversation to determine who said what and when, and makes it easy for developers to add speech-to-text to their applications that perform multi-speaker diarization.
1818

articles/cognitive-services/Speech-Service/includes/audio-input-format-chart.md

Lines changed: 1 addition & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -6,16 +6,10 @@ author: IEvangelist
66
manager: nitinme
77
ms.service: cognitive-services
88
ms.topic: include
9-
ms.date: 03/10/2020
9+
ms.date: 03/16/2020
1010
ms.author: dapine
1111
---
1212

1313
## Supported audio input format
1414

15-
The Speech SDK uses the following format for audio input.
16-
17-
| Format | Codec | Bitrate | Sample Rate | Channels |
18-
|--------|-------|---------|-----------------|----------|
19-
| WAV | PCM | 16-bit | 8 kHz or 16 kHz | 1 (mono) |
20-
2115
[!INCLUDE [supported-audio-formats](supported-audio-formats.md)]

articles/cognitive-services/Speech-Service/includes/supported-audio-formats.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2,11 +2,11 @@
22
author: IEvangelist
33
ms.service: cognitive-services
44
ms.topic: include
5-
ms.date: 03/10/2020
5+
ms.date: 03/16/2020
66
ms.author: dapine
77
---
88

9-
The default audio streaming format is WAV (16 KHz, 16-bit, and mono PCM). Outside of WAV / PCM, the following codec compressed input formats are also supported:
9+
The default audio streaming format is WAV (16 KHz or 8Khz, 16-bit, and mono PCM). Outside of WAV / PCM, the compressed input formats listed below are also supported. [Additional configuration](../how-to-use-codec-compressed-audio-input-streams.md) is needed to enable the formats listed below.
1010

1111
- MP3
1212
- OPUS/OGG
Lines changed: 15 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
### YamlMime:Landing
22

33
title: Conversation scenarios documentation
4-
summary: Explore the various conversation scenarios, Conversation Transcription or multi-device conversation.
4+
summary: Explore the various conversation scenarios. Learn to use Conversation Transcription or Multi-device Conversation in meeting scenarios, and transcription in Call Center scenarios.
55
metadata:
66
title: Conversation scenarios documentation - Tutorials, API Reference - Azure Cognitive Services | Microsoft Docs
77
titleSuffix: Azure Cognitive Services
@@ -10,45 +10,45 @@ metadata:
1010
manager: nitinme
1111
ms.service: speech-service
1212
ms.topic: landing-page
13-
ms.date: 03/10/2020
13+
ms.date: 03/16/2020
1414
ms.author: dapine
1515

1616
landingContent:
17-
- title: About Conversation Transcription
17+
- title: About Conversation Transcription (meeting scenario)
1818
linkLists:
1919
- linkListType: overview
2020
links:
21-
- text: What is Conversation Transcription?
21+
- text: What is Conversation Transcription in meetings?
2222
url: conversation-transcription.md
2323
- linkListType: how-to-guide
2424
links:
25-
- text: Handle call center scenarios
26-
url: call-center-transcription.md
2725
- text: Real-time Conversation Transcription
2826
url: how-to-use-conversation-transcription.md
2927
- text: Asynchronous Conversation Transcription
3028
url: how-to-async-conversation-transcription.md
31-
- linkListType: reference
32-
links:
33-
- text: Conversation Transcription language support
34-
url: conversation-transcription.md#language-support
35-
- title: About multi-device conversation
29+
- title: About Multi-device Conversation (meeting scenario)
3630
linkLists:
3731
- linkListType: overview
3832
links:
39-
- text: What is multi-device conversation?
33+
- text: What is Multi-device Conversation?
4034
url: multi-device-conversation.md
4135
- linkListType: quickstart
4236
links:
4337
- text: Start and join a conversation
4438
url: quickstarts/multi-device-conversation.md
45-
- linkListType: reference
39+
- title: Call Center Transcription (custom/agent scenario)
40+
linkLists:
41+
- linkListType: overview
4642
links:
47-
- text: Multi-device conversation language support
48-
url: multi-device-conversation.md#language-support
43+
- text: Handle call center scenarios
44+
url: call-center-transcription.md
4945
- title: Reference
5046
linkLists:
5147
- linkListType: reference
5248
links:
49+
- text: Conversation Transcription language support
50+
url: conversation-transcription.md#language-support
51+
- text: Multi-device conversation language support
52+
url: multi-device-conversation.md#language-support
5353
- text: Speech transcription pricing
5454
url: https://azure.microsoft.com/pricing/details/cognitive-services/speech-services/

articles/cognitive-services/Speech-Service/index.yml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -93,10 +93,10 @@ conceptualContent:
9393
- title: Conversation scenarios
9494
links:
9595
- itemType: overview
96-
text: About Conversation Transcription
96+
text: About Conversation Transcription (meetings)
9797
url: conversation-transcription.md
9898
- itemType: overview
99-
text: About multi-device conversations
99+
text: About multi-device conversations (meetings)
100100
url: multi-device-conversation.md
101101
- itemType: how-to-guide
102102
text: Call center transcription

articles/cognitive-services/Speech-Service/rest-speech-to-text.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -3,18 +3,18 @@ title: Speech-to-text API reference (REST) - Speech service
33
titleSuffix: Azure Cognitive Services
44
description: Learn how to use the speech-to-text REST API. In this article, you'll learn about authorization options, query options, how to structure a request and receive a response.
55
services: cognitive-services
6-
author: erhopf
6+
author: IEvangelist
77
manager: nitinme
88
ms.service: cognitive-services
99
ms.subservice: speech-service
1010
ms.topic: conceptual
11-
ms.date: 03/03/2020
12-
ms.author: erhopf
11+
ms.date: 03/16/2020
12+
ms.author: dapine
1313
---
1414

1515
# Speech-to-text REST API
1616

17-
As an alternative to the [Speech SDK](speech-sdk.md), the Speech service allows you to convert speech-to-text using a REST API. Each accessible endpoint is associated with a region. Your application requires a subscription key for the endpoint you plan to use.
17+
As an alternative to the [Speech SDK](speech-sdk.md), the Speech service allows you to convert speech-to-text using a REST API. Each accessible endpoint is associated with a region. Your application requires a subscription key for the endpoint you plan to use. The REST API is very limited, and it should only be used in cases were the [Speech SDK](speech-sdk.md) cannot.
1818

1919
Before using the speech-to-text REST API, understand:
2020

articles/cognitive-services/Speech-Service/toc.yml

Lines changed: 20 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -97,10 +97,12 @@
9797
href: rest-speech-to-text.md
9898
- name: Batch transcription REST API
9999
href: batch-transcription.md
100-
- name: Language support
101-
href: 'language-support.md#speech-to-text'
102100
- name: Resources
103101
items:
102+
- name: Language support
103+
href: language-support.md#speech-to-text
104+
- name: Region support
105+
href: regions.md#speech-to-text-text-to-speech-and-translation
104106
- name: Speech-to-text FAQ
105107
displayName: 'FAQ,STT FAQ,SR FAQ,frequently asked questions'
106108
href: faq-stt.md
@@ -193,14 +195,16 @@
193195
items:
194196
- name: Text-to-speech REST API
195197
href: rest-text-to-speech.md
198+
- name: Resources
199+
items:
196200
- name: Standard voice support
197-
href: 'language-support.md#standard-voices'
201+
href: language-support.md#standard-voices
198202
- name: Neural voice support
199-
href: 'language-support.md#neural-voices'
203+
href: language-support.md#neural-voices
204+
- name: Region support
205+
href: regions.md#speech-to-text-text-to-speech-and-translation
200206
- name: SSML phonetic sets
201207
href: speech-ssml-phonetic-sets.md
202-
- name: Resources
203-
items:
204208
- name: Text-to-speech FAQ
205209
displayName: 'FAQ,TTS FAQ,SS FAQ,frequently asked questions'
206210
href: faq-text-to-speech.md
@@ -249,10 +253,12 @@
249253
- name: Release notes
250254
displayName: 'release notes,updates,what''s new,what is new,changes'
251255
href: releasenotes.md
252-
- name: Language support
253-
href: 'language-support.md#speech-translation'
254256
- name: Resources
255257
items:
258+
- name: Language support
259+
href: language-support.md#speech-translation
260+
- name: Region support
261+
href: regions.md#speech-to-text-text-to-speech-and-translation
256262
- name: Speech translation FAQ
257263
href: 'https://www.microsoft.com/translator/faq.aspx'
258264
- name: Intent recognition
@@ -297,6 +303,12 @@
297303
items:
298304
- name: LUIS developer resources
299305
href: https://docs.microsoft.com/azure/cognitive-services/luis/developer-reference-resource
306+
- name: Resources
307+
items:
308+
- name: Language support
309+
href: ../luis/luis-language-support.md
310+
- name: Region support
311+
href: regions.md#intent-recognition
300312
- name: Conversation scenarios
301313
items:
302314
- name: Conversation scenarios documentation

0 commit comments

Comments
 (0)