MicrosoftDocs
diff --git a/‎.openpublishing.redirection.json
Lines changed: 10 additions & 0 deletions b/‎.openpublishing.redirection.json
Lines changed: 10 additions & 0 deletions
diff --git a/‎articles/cognitive-services/Speech-Service/how-to-custom-speech-test-data.md
Lines changed: 25 additions & 21 deletions b/‎articles/cognitive-services/Speech-Service/how-to-custom-speech-test-data.md
Lines changed: 25 additions & 21 deletions
diff --git a/‎articles/cognitive-services/Speech-Service/how-to-use-codec-compressed-audio-input-streams-android.md
Lines changed: 0 additions & 159 deletions b/‎articles/cognitive-services/Speech-Service/how-to-use-codec-compressed-audio-input-streams-android.md
Lines changed: 0 additions & 159 deletions
diff --git a/‎articles/cognitive-services/Speech-Service/how-to-use-codec-compressed-audio-input-streams-ios.md
Lines changed: 0 additions & 66 deletions b/‎articles/cognitive-services/Speech-Service/how-to-use-codec-compressed-audio-input-streams-ios.md
Lines changed: 0 additions & 66 deletions
@@ -49205,6 +49205,16 @@
       "source_path": "articles/cognitive-services/Speech-Service/sapi-phoneset-usage.md",
       "redirect_url": "/azure/cognitive-services/speech-service/speech-ssml-phonetic-sets",
       "redirect_document_id": false
+    },
+    {
+      "source_path": "articles/cognitive-services/Speech-Service/how-to-use-codec-compressed-audio-input-streams-android.md",
+      "redirect_url": "/azure/cognitive-services/Speech-Service/how-to-use-codec-compressed-audio-input-streams?pivots=programming-language-java",
+      "redirect_document_id": false
+    },
+    {
+      "source_path": "articles/cognitive-services/Speech-Service/how-to-use-codec-compressed-audio-input-streams-ios.md",
+      "redirect_url": "/azure/cognitive-services/Speech-Service/how-to-use-codec-compressed-audio-input-streams?pivots=programming-language-objectivec",
+      "redirect_document_id": false
     }
   ]
 }
@@ -3,13 +3,13 @@ title: "Prepare test data for Custom Speech - Speech service"
 titleSuffix: Azure Cognitive Services
 description: "When testing the accuracy of Microsoft speech recognition or training your custom models, you'll need audio and text data. On this page, we cover the types of data, how to use, and manage them."
 services: cognitive-services
-author: erhopf
+author: IEvangelist
 manager: nitinme
 ms.service: cognitive-services
 ms.subservice: speech-service
 ms.topic: conceptual
-ms.date: 12/17/2019
-ms.author: erhopf
+ms.date: 03/09/2020
+ms.author: dapine
 ---
 
 # Prepare data for Custom Speech
@@ -50,15 +50,17 @@ Audio data is optimal for testing the accuracy of Microsoft's baseline speech-to
 
 Use this table to ensure that your audio files are formatted correctly for use with Custom Speech:
 
-| Property | Value |
-|----------|-------|
-| File format | RIFF (WAV) |
-| Sample rate | 8,000 Hz or 16,000 Hz |
-| Channels | 1 (mono) |
-| Maximum length per audio | 2 hours |
-| Sample format | PCM, 16-bit |
-| Archive format | .zip |
-| Maximum archive size | 2 GB |
+| Property                 | Value                 |
+|--------------------------|-----------------------|
+| File format              | RIFF (WAV)            |
+| Sample rate              | 8,000 Hz or 16,000 Hz |
+| Channels                 | 1 (mono)              |
+| Maximum length per audio | 2 hours               |
+| Sample format            | PCM, 16-bit           |
+| Archive format           | .zip                  |
+| Maximum archive size     | 2 GB                  |
+
+[!INCLUDE [supported-audio-formats](includes/supported-audio-formats.md)]
 
 > [!TIP]
 > When uploading training and testing data, the .zip file size cannot exceed 2 GB. If you require more data for training, divide it into several .zip files and upload them separately. Later, you can choose to train from *multiple* datasets. However, you can only test from a *single* dataset.
@@ -74,18 +76,20 @@ Use <a href="http://sox.sourceforge.net" target="_blank" rel="noopener">SoX <spa
 
 To measure the accuracy of Microsoft's speech-to-text accuracy when processing your audio files, you must provide human-labeled transcriptions (word-by-word) for comparison. While human-labeled transcription is often time consuming, it's necessary to evaluate accuracy and to train the model for your use cases. Keep in mind, the improvements in recognition will only be as good as the data provided. For that reason, it's important that only high-quality transcripts are uploaded.
 
-| Property | Value |
-|----------|-------|
-| File format | RIFF (WAV) |
-| Sample rate | 8,000 Hz or 16,000 Hz |
-| Channels | 1 (mono) |
+| Property                 | Value                               |
+|--------------------------|-------------------------------------|
+| File format              | RIFF (WAV)                          |
+| Sample rate              | 8,000 Hz or 16,000 Hz               |
+| Channels                 | 1 (mono)                            |
 | Maximum length per audio | 2 hours (testing) / 60 s (training) |
-| Sample format | PCM, 16-bit |
-| Archive format | .zip |
-| Maximum zip size | 2 GB |
+| Sample format            | PCM, 16-bit                         |
+| Archive format           | .zip                                |
+| Maximum zip size         | 2 GB                                |
+
+[!INCLUDE [supported-audio-formats](includes/supported-audio-formats.md)]
 
 > [!NOTE]
-> When uploading training and testing data, the .zip file size cannot exceed 2 GB. Uou can only test from a *single* dataset, be sure to keep it within the appropriate file size.
+> When uploading training and testing data, the .zip file size cannot exceed 2 GB. You can only test from a *single* dataset, be sure to keep it within the appropriate file size. Additionally, each training file cannot exceed 60 seconds otherwise it will error out.
 
 To address issues like word deletion or substitution, a significant amount of data is required to improve recognition. Generally, it's recommended to provide word-by-word transcriptions for roughly 10 to 1,000 hours of audio. The transcriptions for all WAV files should be contained in a single plain-text file. Each line of the transcription file should contain the name of one of the audio files, followed by the corresponding transcription. The file name and transcription should be separated by a tab (\t).
Original file line number	Diff line number	Diff line change
`@@ -49205,6 +49205,16 @@`
`49205`	`49205`	`"source_path": "articles/cognitive-services/Speech-Service/sapi-phoneset-usage.md",`
`49206`	`49206`	`"redirect_url": "/azure/cognitive-services/speech-service/speech-ssml-phonetic-sets",`
`49207`	`49207`	`"redirect_document_id": false`
	`49208`	`+ },`
	`49209`	`+ {`
	`49210`	`+ "source_path": "articles/cognitive-services/Speech-Service/how-to-use-codec-compressed-audio-input-streams-android.md",`
	`49211`	`+ "redirect_url": "/azure/cognitive-services/Speech-Service/how-to-use-codec-compressed-audio-input-streams?pivots=programming-language-java",`
	`49212`	`+ "redirect_document_id": false`
	`49213`	`+ },`
	`49214`	`+ {`
	`49215`	`+ "source_path": "articles/cognitive-services/Speech-Service/how-to-use-codec-compressed-audio-input-streams-ios.md",`
	`49216`	`+ "redirect_url": "/azure/cognitive-services/Speech-Service/how-to-use-codec-compressed-audio-input-streams?pivots=programming-language-objectivec",`
	`49217`	`+ "redirect_document_id": false`
`49208`	`49218`	`}`
`49209`	`49219`	`]`
`49210`	`49220`	`}`