MicrosoftDocs
diff --git a/‎articles/cognitive-services/Speech-Service/call-center-transcription.md
Lines changed: 54 additions & 47 deletions b/‎articles/cognitive-services/Speech-Service/call-center-transcription.md
Lines changed: 54 additions & 47 deletions
diff --git a/‎articles/cognitive-services/Speech-Service/conversation-transcription.md
Lines changed: 1 addition & 1 deletion b/‎articles/cognitive-services/Speech-Service/conversation-transcription.md
Lines changed: 1 addition & 1 deletion
diff --git a/‎articles/cognitive-services/Speech-Service/how-to-custom-speech-human-labeled-transcriptions.md
Lines changed: 76 additions & 75 deletions b/‎articles/cognitive-services/Speech-Service/how-to-custom-speech-human-labeled-transcriptions.md
Lines changed: 76 additions & 75 deletions
diff --git a/‎articles/cognitive-services/Speech-Service/how-to-custom-speech-inspect-data.md
Lines changed: 8 additions & 8 deletions b/‎articles/cognitive-services/Speech-Service/how-to-custom-speech-inspect-data.md
Lines changed: 8 additions & 8 deletions
@@ -1,7 +1,7 @@
 ---
 title: What is Conversation Transcription (Preview) - Speech Service
 titleSuffix: Azure Cognitive Services
-description: Conversation Transcription is a speech-to-text solution that combines speech recognition, speaker identification, and sentence attribution to each speaker (also known as diarization) to provide real-time and/or asynchronous transcription of any conversation. Conversation Transcription makes conversations inclusive for everyone, such as participants who are deaf and hard of hearing.
+description: Conversation Transcription is a speech-to-text solution that combines speech recognition, speaker identification, and sentence attribution to each speaker (also known as diarization) to provide real-time and/or asynchronous transcription of any conversation.
 services: cognitive-services
 author: markamos
 manager: nitinme
 
@@ -1,10 +1,11 @@
 ---
-title: "Human-labeled transcriptions guidelines - Speech Service"
+title: Human-labeled transcriptions guidelines - Speech Service
 titleSuffix: Azure Cognitive Services
-description: "If you're looking to improve recognition accuracy, especially issues that are caused when words are deleted or incorrectly substituted, you'll want to use human-labeled transcriptions along with your audio data. What are human-labeled transcriptions? That's easy, they're word-by-word, verbatim transcriptions of an audio file."
+description: To improve speech recognition accuracy, such as when words are deleted or incorrectly substituted, you can use human-labeled transcriptions along with your audio data. Human-labeled transcriptions are word-by-word, verbatim transcriptions of an audio file.
 services: cognitive-services
 author: erhopf
 manager: nitinme
+
 ms.service: cognitive-services
 ms.subservice: speech-service
 ms.topic: conceptual
@@ -25,7 +26,7 @@ Human-labeled transcriptions for English audio must be provided as plain text, o
 Here are a few examples:
 
 | Characters to avoid | Substitution | Notes |
-|---------------------|--------------|-------|
+| ------------------- | ------------ | ----- |
 | “Hello world” | "Hello world" | The opening and closing quotations marks have been substituted with appropriate ASCII characters. |
 | John’s day | John's day | The apostrophe has been substituted with the appropriate ASCII character. |
 | it was good—no, it was great! | it was good--no, it was great! | The em dash was substituted with two hyphens. |
@@ -34,88 +35,88 @@ Here are a few examples:
 
 Text normalization is the transformation of words into a consistent format used when training a model. Some normalization rules are applied to text automatically, however, we recommend using these guidelines as you prepare your human-labeled transcription data:
 
-* Write out abbreviations in words.
-* Write out non-standard numeric strings in words (such as accounting terms).
-* Non-alphabetic characters or mixed alphanumeric characters should be transcribed as pronounced.
-* Abbreviations that are pronounced as words shouldn't be edited (such as "radar", "laser", "RAM", or "NATO").
-* Write out abbreviations that are pronounced as separate letters with each letter separated by a space.
+- Write out abbreviations in words.
+- Write out non-standard numeric strings in words (such as accounting terms).
+- Non-alphabetic characters or mixed alphanumeric characters should be transcribed as pronounced.
+- Abbreviations that are pronounced as words shouldn't be edited (such as "radar", "laser", "RAM", or "NATO").
+- Write out abbreviations that are pronounced as separate letters with each letter separated by a space.
 
 Here are a few examples of normalization that you should perform on the transcription:
 
-| Original text | Text after normalization |
-|---------------|--------------------------|
-| Dr. Bruce Banner | Doctor Bruce Banner |
-| James Bond, 007 | James Bond, double oh seven |
-| Ke$ha | Kesha |
-| How long is the 2x4 | How long is the two by four |
+| Original text               | Text after normalization              |
+| --------------------------- | ------------------------------------- |
+| Dr. Bruce Banner            | Doctor Bruce Banner                   |
+| James Bond, 007             | James Bond, double oh seven           |
+| Ke$ha                       | Kesha                                 |
+| How long is the 2x4         | How long is the two by four           |
 | The meeting goes from 1-3pm | The meeting goes from one to three pm |
-| My blood type is O+ | My blood type is O positive |
-| Water is H20 | Water is H 2 O |
-| Play OU812 by Van Halen | Play O U 8 1 2 by Van Halen |
-| UTF-8 with BOM | U T F 8 with BOM |
+| My blood type is O+         | My blood type is O positive           |
+| Water is H20                | Water is H 2 O                        |
+| Play OU812 by Van Halen     | Play O U 8 1 2 by Van Halen           |
+| UTF-8 with BOM              | U T F 8 with BOM                      |
 
 The following normalization rules are automatically applied to transcriptions:
 
-* Use lowercase letters.
-* Remove all punctuation except apostrophes within words.
-* Expand numbers into words/spoken form, such as dollar amounts.
+- Use lowercase letters.
+- Remove all punctuation except apostrophes within words.
+- Expand numbers into words/spoken form, such as dollar amounts.
 
 Here are a few examples of normalization automatically performed on the transcription:
 
-| Original text | Text after normalization |
-|---------------|--------------------------|
-| "Holy cow!" said Batman. | holy cow said batman |
+| Original text                          | Text after normalization          |
+| -------------------------------------- | --------------------------------- |
+| "Holy cow!" said Batman.               | holy cow said batman              |
 | "What?" said Batman's sidekick, Robin. | what said batman's sidekick robin |
-| Go get -em! | go get em |
-| I'm double-jointed | I'm double jointed |
-| 104 Elm Street | one oh four Elm street |
-| Tune to 102.7 | tune to one oh two point seven |
-| Pi is about 3.14 | pi is about three point one four |
-It costs $3.14| it costs three fourteen |
+| Go get -em!                            | go get em                         |
+| I'm double-jointed                     | I'm double jointed                |
+| 104 Elm Street                         | one oh four Elm street            |
+| Tune to 102.7                          | tune to one oh two point seven    |
+| Pi is about 3.14                       | pi is about three point one four  |
+| It costs \$3.14                        | it costs three fourteen           |
 
 ## Mandarin Chinese (zh-CN)
 
 Human-labeled transcriptions for Mandarin Chinese audio must be UTF-8 encoded with a byte-order marker. Avoid the use of half-width punctuation characters. These characters can be included inadvertently when you prepare the data in a word-processing program or scrape data from web pages. If these characters are present, make sure to update them with the appropriate full-width substitution.
 
 Here are a few examples:
 
-| Characters to avoid | Substitution | Notes |
-|---------------------|--------------|-------|
+| Characters to avoid | Substitution   | Notes |
+| ------------------- | -------------- | ----- |
 | "你好" | "你好" | The opening and closing quotations marks have been substituted with appropriate characters. |
-| 需要什么帮助? | 需要什么帮助？ | The question mark has been substituted with appropriate character. |
+| 需要什么帮助? | 需要什么帮助？| The question mark has been substituted with appropriate character. |
 
 ### Text normalization for Mandarin Chinese
 
 Text normalization is the transformation of words into a consistent format used when training a model. Some normalization rules are applied to text automatically, however, we recommend using these guidelines as you prepare your human-labeled transcription data:
 
-* Write out abbreviations in words.
-* Write out numeric strings in spoken form.
+- Write out abbreviations in words.
+- Write out numeric strings in spoken form.
 
 Here are a few examples of normalization that you should perform on the transcription:
 
 | Original text | Text after normalization |
-|---------------|--------------------------|
-| 我今年21 | 我今年二十一 |
-| 3号楼504 | 三号 楼 五 零 四 |
+| ------------- | ------------------------ |
+| 我今年 21 | 我今年二十一 |
+| 3 号楼 504 | 三号 楼 五 零 四 |
 
 The following normalization rules are automatically applied to transcriptions:
 
-* Remove all punctuation
-* Expand numbers to spoken form
-* Convert full-width letters to half-width letters
-* Using uppercase letters for all English words
+- Remove all punctuation
+- Expand numbers to spoken form
+- Convert full-width letters to half-width letters
+- Using uppercase letters for all English words
 
 Here are a few examples of normalization automatically performed on the transcription:
 
 | Original text | Text after normalization |
-|---------------|--------------------------|
+| ------------- | ------------------------ |
 | 3.1415 | 三 点 一 四 一 五 |
-| ￥3.5 | 三 元 五 角 |
-| w f y z |W F Y Z |
-| 1992年8月8日 | 一 九 九 二 年 八 月 八 日 |
+| ￥ 3.5 | 三 元 五 角 |
+| w f y z | W F Y Z |
+| 1992 年 8 月 8 日 | 一 九 九 二 年 八 月 八 日 |
 | 你吃饭了吗? | 你 吃饭 了 吗 |
-| 下午5:00的航班 | 下午 五点 的 航班 |
-| 我今年21岁 | 我 今年 二十 一 岁 |
+| 下午 5:00 的航班 | 下午 五点 的 航班 |
+| 我今年 21 岁 | 我 今年 二十 一 岁 |
 
 ## German (de-DE) and other languages
 
@@ -125,42 +126,42 @@ Human-labeled transcriptions for German audio (and other non-English or Mandarin
 
 Text normalization is the transformation of words into a consistent format used when training a model. Some normalization rules are applied to text automatically, however, we recommend using these guidelines as you prepare your human-labeled transcription data:
 
-*	Write decimal points as "," and not ".".
-*	Write time separators as ":" and not "." (for example: 12:00 Uhr).
-*	Abbreviations such as "ca." aren't replaced. We recommend that you use the full spoken form.
-*	The four main mathematical operators (+, -, \*, and /) are removed. We recommend replacing them with the written form: "plus," "minus," "mal," and "geteilt."
-*	Comparison operators are removed (=, <, and >). We recommend replacing them with "gleich," "kleiner als," and "grösser als."
-*	Write fractions, such as 3/4, in written form (for example: "drei viertel" instead of 3/4).
-*	Replace the "€" symbol with its written form "Euro."
+- Write decimal points as "," and not ".".
+- Write time separators as ":" and not "." (for example: 12:00 Uhr).
+- Abbreviations such as "ca." aren't replaced. We recommend that you use the full spoken form.
+- The four main mathematical operators (+, -, \*, and /) are removed. We recommend replacing them with the written form: "plus," "minus," "mal," and "geteilt."
+- Comparison operators are removed (=, <, and >). We recommend replacing them with "gleich," "kleiner als," and "grösser als."
+- Write fractions, such as 3/4, in written form (for example: "drei viertel" instead of 3/4).
+- Replace the "€" symbol with its written form "Euro."
 
 Here are a few examples of normalization that you should perform on the transcription:
 
-| Original text | Text after user normalization | Text after system normalization |
-|---------------|-------------------------------|---------------------------------|
-| Es ist 12.23 Uhr | Es ist 12:23 Uhr | es ist zwölf uhr drei und zwanzig uhr |
-| {12.45} | {12,45} | zwölf komma vier fünf |
-| 2 + 3 - 4 | 2 plus 3 minus 4 | zwei plus drei minus vier |
+| Original text    | Text after user normalization | Text after system normalization       |
+| ---------------- | ----------------------------- | ------------------------------------- |
+| Es ist 12.23 Uhr | Es ist 12:23 Uhr              | es ist zwölf uhr drei und zwanzig uhr |
+| {12.45}          | {12,45}                       | zwölf komma vier fünf                 |
+| 2 + 3 - 4        | 2 plus 3 minus 4              | zwei plus drei minus vier             |
 
 The following normalization rules are automatically applied to transcriptions:
 
-* Use lowercase letters for all text.
-* Remove all punctuation, including various types of quotation marks ("test", 'test', "test„, and «test» are OK).
-* Discard rows with any special characters from this set: ¢ ¤ ¥ ¦ § © ª ¬ ® ° ± ² µ × ÿ Ø¬¬.
-* Expand numbers to spoken form, including dollar or Euro amounts.
-* Accept umlauts only for a, o, and u. Others will be replaced by "th" or be discarded.
+- Use lowercase letters for all text.
+- Remove all punctuation, including various types of quotation marks ("test", 'test', "test„, and «test» are OK).
+- Discard rows with any special characters from this set: ¢ ¤ ¥ ¦ § © ª ¬ ® ° ± ² µ × ÿ Ø¬¬.
+- Expand numbers to spoken form, including dollar or Euro amounts.
+- Accept umlauts only for a, o, and u. Others will be replaced by "th" or be discarded.
 
 Here are a few examples of normalization automatically performed on the transcription:
 
-| Original text | Text after normalization |
-|---------------|--------------------------|
-| Frankfurter Ring | frankfurter ring |
-| ¡Eine Frage! | eine frage |
-| wir, haben | wir haben |
+| Original text    | Text after normalization |
+| ---------------- | ------------------------ |
+| Frankfurter Ring | frankfurter ring         |
+| ¡Eine Frage!     | eine frage               |
+| wir, haben       | wir haben                |
 
 ## Next Steps
 
-* [Prepare and test your data](how-to-custom-speech-test-data.md)
-* [Inspect your data](how-to-custom-speech-inspect-data.md)
-* [Evaluate your data](how-to-custom-speech-evaluate-data.md)
-* [Train your model](how-to-custom-speech-train-model.md)
-* [Deploy your model](how-to-custom-speech-deploy-model.md)
+- [Prepare and test your data](how-to-custom-speech-test-data.md)
+- [Inspect your data](how-to-custom-speech-inspect-data.md)
+- [Evaluate your data](how-to-custom-speech-evaluate-data.md)
+- [Train your model](how-to-custom-speech-train-model.md)
+- [Deploy your model](how-to-custom-speech-deploy-model.md)
@@ -1,7 +1,7 @@
 ---
-title: "Inspect data quality for Custom Speech - Speech Service"
+title: Inspect data quality for Custom Speech - Speech Service
 titleSuffix: Azure Cognitive Services
-description: "Custom Speech provides tools that allow you to visually inspect the recognition quality of a model by comparing audio data with the corresponding recognition result. From the Custom Speech portal, you can play back uploaded audio and determine if the provided recognition result is correct.  This tool allows you to quickly inspect quality of our baseline speech-to-text model or a trained custom model without having to transcribe any audio data."
+description: Custom Speech provides tools that allow you to visually inspect the recognition quality of a model by comparing audio data with the corresponding recognition result. You can play back uploaded audio and determine if the provided recognition result is correct.
 services: cognitive-services
 author: erhopf
 manager: nitinme
@@ -38,18 +38,18 @@ After a test has been successfully created, you can compare the models side by s
 
 ## Side-by-side model comparisons
 
-When the test status is *Succeeded*, click in the test item name to see details of the test. This detail page lists all the utterances in your dataset, indicating the recognition results of the two models alongside the transcription from the submitted dataset.
+When the test status is _Succeeded_, click in the test item name to see details of the test. This detail page lists all the utterances in your dataset, indicating the recognition results of the two models alongside the transcription from the submitted dataset.
 
 To help inspect the side-by-side comparison, you can toggle various error types including insertion, deletion, and substitution. By listening to the audio and comparing recognition results in each column (showing human-labeled transcription and the results of two speech-to-text models), you can decide which model meets your needs and where improvements are needed.
 
-Inspecting quality testing is useful to validate if the quality of a speech recognition endpoint is enough for an application.  For an objective measure of accuracy, requiring transcribed audio, follow the instructions found in [Evaluate Accuracy](how-to-custom-speech-evaluate-data.md).
+Inspecting quality testing is useful to validate if the quality of a speech recognition endpoint is enough for an application. For an objective measure of accuracy, requiring transcribed audio, follow the instructions found in [Evaluate Accuracy](how-to-custom-speech-evaluate-data.md).
 
 ## Next steps
 
-* [Evaluate your data](how-to-custom-speech-evaluate-data.md)
-* [Train your model](how-to-custom-speech-train-model.md)
-* [Deploy your model](how-to-custom-speech-deploy-model.md)
+- [Evaluate your data](how-to-custom-speech-evaluate-data.md)
+- [Train your model](how-to-custom-speech-train-model.md)
+- [Deploy your model](how-to-custom-speech-deploy-model.md)
 
 ## Additional resources
 
-* [Prepare test data for Custom Speech](how-to-custom-speech-test-data.md)
+- [Prepare test data for Custom Speech](how-to-custom-speech-test-data.md)