You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/cognitive-services/Speech-Service/batch-transcription.md
+15-15Lines changed: 15 additions & 15 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,5 +1,5 @@
1
1
---
2
-
title: How to use batch transcription - Speech service
2
+
title: What is batch transcription - Speech service
3
3
titleSuffix: Azure Cognitive Services
4
4
description: Batch transcription is ideal if you want to transcribe a large quantity of audio in storage, such as Azure Blobs. By using the dedicated REST API, you can point to audio files with a shared access signature (SAS) URI and asynchronously receive transcriptions.
5
5
services: cognitive-services
@@ -8,11 +8,11 @@ manager: nitinme
8
8
ms.service: cognitive-services
9
9
ms.subservice: speech-service
10
10
ms.topic: conceptual
11
-
ms.date: 03/16/2020
11
+
ms.date: 03/17/2020
12
12
ms.author: panosper
13
13
---
14
14
15
-
# How to use batch transcription
15
+
# What is batch transcription?
16
16
17
17
Batch transcription is ideal for transcribing a large amount of audio in storage. By using the dedicated REST API, you can point to audio files with a shared access signature (SAS) URI and asynchronously receive transcription results.
18
18
@@ -48,11 +48,11 @@ If you plan to customize acoustic or language models, follow the steps in [Custo
48
48
49
49
The Batch Transcription API supports the following formats:
50
50
51
-
| Format | Codec | Bitrate | Sample Rate |
52
-
|--------|-------|---------|-------------|
53
-
| WAV | PCM | 16-bit | 8 kHz or 16 kHz, mono or stereo |
54
-
| MP3 | PCM | 16-bit | 8 kHz or 16 kHz, mono or stereo |
55
-
| OGG | OPUS | 16-bit | 8 kHz or 16 kHz, mono or stereo |
| WAV | PCM | 16-bit| 8 kHz or 16 kHz, mono or stereo |
54
+
| MP3 | PCM | 16-bit| 8 kHz or 16 kHz, mono or stereo |
55
+
| OGG | OPUS | 16-bit| 8 kHz or 16 kHz, mono or stereo |
56
56
57
57
For stereo audio streams, the left and right channels are split during the transcription. For each channel, a JSON result file is being created. The timestamps generated per utterance enable the developer to create an ordered final transcript.
58
58
@@ -142,7 +142,7 @@ For mono input audio, one transcription result file is being created. For stereo
142
142
143
143
```json
144
144
{
145
-
"AudioFileResults":[
145
+
"AudioFileResults":[
146
146
{
147
147
"AudioFileName": "Channel.0.wav | Channel.1.wav"'maximum of 2 channels supported'
148
148
"AudioFileUrl": null'always null'
@@ -204,12 +204,12 @@ For mono input audio, one transcription result file is being created. For stereo
204
204
205
205
The result contains these forms:
206
206
207
-
|Form|Content|
208
-
|-|-|
209
-
|`Lexical`|The actual words recognized.
210
-
|`ITN`|Inverse-text-normalized form of the recognized text. Abbreviations ("doctor smith" to "dr smith"), phone numbers, and other transformations are applied.
211
-
|`MaskedITN`|The ITN form with profanity masking applied.
212
-
|`Display`|The display form of the recognized text. This includes added punctuation and capitalization.
|`ITN`|Inverse-text-normalized form of the recognized text. Abbreviations ("doctor smith" to "dr smith"), phone numbers, and other transformations are applied.|
211
+
|`MaskedITN`|The ITN form with profanity masking applied.|
212
+
|`Display`|The display form of the recognized text. This includes added punctuation and capitalization.|
Copy file name to clipboardExpand all lines: articles/cognitive-services/Speech-Service/index-speech-to-text.yml
+7-3Lines changed: 7 additions & 3 deletions
Original file line number
Diff line number
Diff line change
@@ -1,7 +1,7 @@
1
1
### YamlMime:Landing
2
2
3
3
title: Speech-to-text documentation
4
-
summary: Speech-to-text from the Speech service, also known as speech recognition, enables real-time transcription of audio streams into text.
4
+
summary: Speech-to-text from the Speech service, also known as speech recognition, enables real-time and batch transcription of audio streams into text.
5
5
metadata:
6
6
title: Speech-to-text documentation - Tutorials, API Reference - Azure Cognitive Services | Microsoft Docs
0 commit comments