Merge pull request #107970 from IEvangelist/tocAgain

itechedit · web-flow · commit 01a2c7669dcc · 2020-03-17T09:57:49.000-07:00
Updates from working with Chris
diff --git a/articles/cognitive-services/Speech-Service/batch-transcription.md b/articles/cognitive-services/Speech-Service/batch-transcription.md
@@ -1,5 +1,5 @@
 ---
-title: How to use batch transcription - Speech service
+title: What is batch transcription - Speech service
 titleSuffix: Azure Cognitive Services
 description: Batch transcription is ideal if you want to transcribe a large quantity of audio in storage, such as Azure Blobs. By using the dedicated REST API, you can point to audio files with a shared access signature (SAS) URI and asynchronously receive transcriptions.
 services: cognitive-services
@@ -8,11 +8,11 @@ manager: nitinme
 ms.service: cognitive-services
 ms.subservice: speech-service
 ms.topic: conceptual
-ms.date: 03/16/2020
+ms.date: 03/17/2020
 ms.author: panosper
 ---
 
-# How to use batch transcription
+# What is batch transcription?
 
 Batch transcription is ideal for transcribing a large amount of audio in storage. By using the dedicated REST API, you can point to audio files with a shared access signature (SAS) URI and asynchronously receive transcription results.
 
@@ -48,11 +48,11 @@ If you plan to customize acoustic or language models, follow the steps in [Custo
 
 The Batch Transcription API supports the following formats:
 
-| Format | Codec | Bitrate | Sample Rate |
-|--------|-------|---------|-------------|
-| WAV | PCM | 16-bit | 8 kHz or 16 kHz, mono or stereo |
-| MP3 | PCM | 16-bit | 8 kHz or 16 kHz, mono or stereo |
-| OGG | OPUS | 16-bit | 8 kHz or 16 kHz, mono or stereo |
+| Format | Codec | Bitrate | Sample Rate                     |
+|--------|-------|---------|---------------------------------|
+| WAV    | PCM   | 16-bit  | 8 kHz or 16 kHz, mono or stereo |
+| MP3    | PCM   | 16-bit  | 8 kHz or 16 kHz, mono or stereo |
+| OGG    | OPUS  | 16-bit  | 8 kHz or 16 kHz, mono or stereo |
 
 For stereo audio streams, the left and right channels are split during the transcription. For each channel, a JSON result file is being created. The timestamps generated per utterance enable the developer to create an ordered final transcript.
 
@@ -142,7 +142,7 @@ For mono input audio, one transcription result file is being created. For stereo
 
 ```json
 {
-  "AudioFileResults":[ 
+  "AudioFileResults":[
     {
       "AudioFileName": "Channel.0.wav | Channel.1.wav"      'maximum of 2 channels supported'
       "AudioFileUrl": null                                  'always null'
@@ -204,12 +204,12 @@ For mono input audio, one transcription result file is being created. For stereo
 
 The result contains these forms:
 
-|Form|Content|
-|-|-|
-|`Lexical`|The actual words recognized.
-|`ITN`|Inverse-text-normalized form of the recognized text. Abbreviations ("doctor smith" to "dr smith"), phone numbers, and other transformations are applied.
-|`MaskedITN`|The ITN form with profanity masking applied.
-|`Display`|The display form of the recognized text. This includes added punctuation and capitalization.
+| Form        | Content                                                                                                                                                  |
+|-------------|----------------------------------------------------------------------------------------------------------------------------------------------------------|
+| `Lexical`   | The actual words recognized.                                                                                                                             |
+| `ITN`       | Inverse-text-normalized form of the recognized text. Abbreviations ("doctor smith" to "dr smith"), phone numbers, and other transformations are applied. |
+| `MaskedITN` | The ITN form with profanity masking applied.                                                                                                             |
+| `Display`   | The display form of the recognized text. This includes added punctuation and capitalization.                                                             |
 
 ## Speaker separation (Diarization)
 
diff --git a/articles/cognitive-services/Speech-Service/index-speech-to-text.yml b/articles/cognitive-services/Speech-Service/index-speech-to-text.yml
@@ -1,7 +1,7 @@
 ### YamlMime:Landing
 
 title: Speech-to-text documentation
-summary: Speech-to-text from the Speech service, also known as speech recognition, enables real-time transcription of audio streams into text.
+summary: Speech-to-text from the Speech service, also known as speech recognition, enables real-time and batch transcription of audio streams into text.
 metadata:
   title: Speech-to-text documentation - Tutorials, API Reference - Azure Cognitive Services | Microsoft Docs
   titleSuffix: Azure Cognitive Services
@@ -10,16 +10,20 @@ metadata:
   manager: nitinme
   ms.service: speech-service
   ms.topic: landing-page
-  ms.date: 03/10/2020
+  ms.date: 03/17/2020
   ms.author: dapine
 
 landingContent:
 - title: About speech-to-text
   linkLists:
     - linkListType: overview
       links:
-      - text: What is speech-to-text?
+      - text: What is real-time speech-to-text?
         url: speech-to-text.md
+      - text: What is batch speech-to-text?
+        url: batch-transcription.md
+      - text: Speech recognition basics?
+        url: speech-to-text-basics.md
     - linkListType: quickstart
       links:
         - text: Recognize speech with microphone input
diff --git a/articles/cognitive-services/Speech-Service/toc.yml b/articles/cognitive-services/Speech-Service/toc.yml
@@ -18,8 +18,10 @@
       href: index-speech-to-text.yml
     - name: Overview
       items:
-        - name: What is speech-to-text?
+        - name: What is real-time speech-to-text?
           href: speech-to-text.md
+        - name: What is batch speech-to-text?
+          href: batch-transcription.md
         - name: Speech recognition basics
           href: speech-to-text-basics.md
     - name: Quickstart
@@ -40,6 +42,7 @@
           href: how-to-custom-speech.md
         - name: Use compressed audio input formats
           href: how-to-use-codec-compressed-audio-input-streams.md
+          displayName: codec,codecs,compression,compressed,mp3,flac,mulaw,alaw,mp4,mp4a,wav,opus,ogg,pcm,silk
         - name: Improve accuracy with Phrase Lists
           href: how-to-phrase-lists.md
         - name: Improve accuracy with tenant models
@@ -95,8 +98,6 @@
           items:
             - name: Speech-to-text REST API
               href: rest-speech-to-text.md
-            - name: Batch transcription REST API
-              href: batch-transcription.md
     - name: Resources
       items:
         - name: Language support