Merge pull request #406 from eric-urban/eur/speech-refresh-3

Stacyrch140 · web-flow · commit 6df053949dc4 · 2024-09-19T23:26:34.000-04:00
refresh speech docs
diff --git a/articles/ai-services/speech-service/improve-accuracy-phrase-list.md b/articles/ai-services/speech-service/improve-accuracy-phrase-list.md
@@ -1,21 +1,21 @@
 ---
 title: Improve recognition accuracy with phrase list
 description: Phrase lists can be used to customize speech recognition results based on context. 
-author: ut-karsh
-ms.author: umaheshwari
+author: eric-urban
+ms.author: eur
+ms.reviewer: umaheshwari
 ms.service: azure-ai-speech
 ms.custom: devx-track-extended-java, devx-track-js, devx-track-python
 ms.topic: how-to
-ms.date: 1/21/2024
+ms.date: 9/20/2024
 zone_pivot_groups: programming-languages-set-two-with-js-spx
+#Customer intent: As a developer using speech to text, I want to learn how to improve recognition accuracy with phrase list.
 ---
 
 # Improve recognition accuracy with phrase list
 
 A phrase list is a list of words or phrases provided ahead of time to help improve their recognition. Adding a phrase to a phrase list increases its importance, thus making it more likely to be recognized.
 
-For supported phrase list locales, see [Language and voice support for the Speech service](language-support.md?tabs=phraselist).
-
 Examples of phrases include:
 * Names
 * Geographical locations
@@ -26,6 +26,8 @@ Phrase lists are simple and lightweight:
 - **Just-in-time**: A phrase list is provided just before starting the speech recognition, eliminating the need to train a custom model. 
 - **Lightweight**: You don't need a large data set. Provide a word or phrase to boost its recognition.
 
+For supported phrase list locales, see [Language and voice support for the Speech service](language-support.md?tabs=phraselist).
+
 You can use phrase lists with the [Speech Studio](speech-studio-overview.md), [Speech SDK](quickstarts/setup-platform.md), or [Speech Command Line Interface (CLI)](spx-overview.md). The [Batch transcription API](batch-transcription.md) doesn't support phrase lists.
 
 You can use phrase lists with both standard and [custom speech](custom-speech-overview.md). There are some situations where training a custom model that includes phrases is likely the best option to improve accuracy. For example, in the following cases you would use custom speech: 
diff --git a/articles/ai-services/speech-service/index-speech-to-text.yml b/articles/ai-services/speech-service/index-speech-to-text.yml
@@ -9,7 +9,7 @@ metadata:
   manager: nitinme
   ms.service: azure-ai-speech
   ms.topic: landing-page
-  ms.date: 8/20/2024
+  ms.date: 9/20/2024
   ms.author: eur
 
 landingContent:
@@ -29,26 +29,28 @@ landingContent:
       links:
         - text: Get started with speech to text
           url: get-started-speech-to-text.md
+        - text: Try real-time diarization
+          url: get-started-stt-diarization.md
 - title: Develop with speech to text
   linkLists:
     - linkListType: how-to-guide
       links:
-        - text: Choose speech recognition mode
-          url: ./get-started-speech-to-text.md
-        - text: Improve accuracy with custom speech
-          url: ./custom-speech-overview.md
+        - text: Use the fast transcription API
+          url: fast-transcription-create.md
+        - text: Create a custom speech project
+          url: ./how-to-custom-speech-create-project.md
+        - text: Train a model for custom speech
+          url: how-to-custom-speech-train-model.md
         - text: Use compressed audio input formats
           url: how-to-use-codec-compressed-audio-input-streams.md
-        - text: Migrate from v3.0 to v3.1
-          url: migrate-v3-0-to-v3-1.md
     - linkListType: concept
       links:
-        - text: Training and testing datasets
-          url: how-to-custom-speech-test-and-train.md
-        - text: Train a model for custom speech
-          url: how-to-custom-speech-train-model.md
-        - text: Create human-labeled transcriptions
-          url: how-to-custom-speech-human-labeled-transcriptions.md
+        - text: Whisper model from OpenAI
+          url: whisper-overview.md
+        - text: Improve accuracy with custom speech
+          url: ./custom-speech-overview.md
+        - text: Display text formatting
+          url: display-text-format.md
 - title: Reference
   linkLists:
     - linkListType: reference
diff --git a/articles/ai-services/speech-service/index-text-to-speech.yml b/articles/ai-services/speech-service/index-text-to-speech.yml
@@ -9,7 +9,7 @@ metadata:
   manager: nitinme
   ms.service: azure-ai-speech
   ms.topic: landing-page
-  ms.date: 8/20/2024
+  ms.date: 9/20/2024
   ms.author: eur
 
 landingContent:
@@ -29,18 +29,20 @@ landingContent:
   linkLists:
     - linkListType: how-to-guide
       links:
-        - text: Improve synthesis with SSML
-          url: speech-synthesis-markup.md
         - text: Batch synthesis for long-form text
           url: batch-synthesis.md
-    - linkListType: concept
-      links:
-        - text: What is custom neural voice?
-          url: custom-neural-voice.md
         - text: Get started with custom voice
           url: professional-voice-create-project.md
         - text: Create and use custom voice models
           url: professional-voice-train-voice.md
+        - text: Create audio content in Speech Studio
+          url: how-to-audio-content-creation.md
+    - linkListType: concept
+      links:
+        - text: What is custom neural voice?
+          url: custom-neural-voice.md
+        - text: Improve synthesis with SSML
+          url: speech-synthesis-markup.md
 - title: Reference
   linkLists:
     - linkListType: reference
diff --git a/articles/ai-services/speech-service/ingestion-client.md b/articles/ai-services/speech-service/ingestion-client.md
@@ -6,8 +6,9 @@ author: eric-urban
 manager: nitinme
 ms.service: azure-ai-speech
 ms.topic: conceptual
-ms.date: 1/21/2024
+ms.date: 9/20/2024
 ms.author: eur
+#Customer intent: As a developer, I want to learn about the Ingestion Client tool that helps me quickly deploy a call center transcription solution to Azure with a no-code approach.
 ---
 
 # Ingestion Client with Azure AI services
diff --git a/articles/ai-services/speech-service/intent-recognition.md b/articles/ai-services/speech-service/intent-recognition.md
@@ -7,8 +7,8 @@ ms.author: eur
 manager: nitinme
 ms.service: azure-ai-speech
 ms.topic: overview
-ms.date: 1/21/2024
-keywords: intent recognition
+ms.date: 9/20/2024
+#Customer intent: As a developer, I want to learn about intent recognition and how to use it with the Speech service.
 ---
 
 # What is intent recognition?
diff --git a/articles/ai-services/speech-service/keyword-recognition-guidelines.md b/articles/ai-services/speech-service/keyword-recognition-guidelines.md
@@ -2,12 +2,14 @@
 title: Keyword recognition recommendations and guidelines - Speech service
 titleSuffix: Azure AI services
 description: An overview of recommendations and guidelines when using keyword recognition.
-author: hasyashah
+author: eric-urban
+ms.author: eur
 manager: nitinme
 ms.service: azure-ai-speech
 ms.topic: conceptual
-ms.date: 1/21/2024
-ms.author: hasshah
+ms.date: 9/20/2024
+ms.reviewer: hasshah
+#Customer intent: As a developer, I want to learn about recommendations and guidelines for keyword recognition with the Speech service.
 ---
 
 # Recommendations and guidelines for keyword recognition
diff --git a/articles/ai-services/speech-service/keyword-recognition-overview.md b/articles/ai-services/speech-service/keyword-recognition-overview.md
@@ -2,12 +2,14 @@
 title: Keyword recognition overview - Speech service
 titleSuffix: Azure AI services
 description: An overview of the features, capabilities, and restrictions for keyword recognition by using the Speech SDK.
-author: hasyashah
+author: eric-urban
+ms.author: eur
 manager: nitinme
 ms.service: azure-ai-speech
 ms.topic: overview
-ms.date: 1/21/2024
-ms.author: hasshah
+ms.date: 9/20/2024
+ms.reviewer: hasshah
+#Customer intent: As a developer, I want to learn about keyword recognition and how to use it with the Speech service.
 ---
 
 # What is keyword recognition?
@@ -26,7 +28,7 @@ The current system is designed with multiple stages that span the edge and cloud
 
 Accuracy of keyword recognition is measured via the following metrics:
 
-* **Correct accept rate**: Measures the system's ability to recognize the keyword when it's spoken by a user. The correct accept rate is also known as the true positive rate.
+* **Correct accept rate**: Measures the system's ability to recognize the keyword spoken by a user. The correct accept rate is also known as the true positive rate.
 * **False accept rate**: Measures the system's ability to filter out audio that isn't the keyword spoken by a user. The false accept rate is also known as the false positive rate.
 
 The goal is to maximize the correct accept rate while minimizing the false accept rate. The current system is designed to detect a keyword or phrase preceded by a short amount of silence. Detecting a keyword in the middle of a sentence or utterance isn't supported.
diff --git a/articles/ai-services/speech-service/language-identification.md b/articles/ai-services/speech-service/language-identification.md
@@ -7,7 +7,7 @@ manager: nitinme
 ms.service: azure-ai-speech
 ms.custom: devx-track-extended-java, devx-track-js, devx-track-python
 ms.topic: how-to
-ms.date: 02/08/2024
+ms.date: 9/20/2024
 ms.author: eur
 zone_pivot_groups: programming-languages-speech-services-nomore-variant
 #customer intent: As an application developer, I want to use language recognition or translations in order to make my apps work seamlessly for more customers.
@@ -33,9 +33,6 @@ Whether you use language identification with [speech to text](#use-speech-to-tex
 
 Then you make a [recognize once or continuous recognition](#recognize-once-or-continuous) request to the Speech service.
 
-> [!IMPORTANT]
-> Language Identification APIs are simplified with the Speech SDK version 1.25 and later. The `SpeechServiceConnection_SingleLanguageIdPriority` and `SpeechServiceConnection_ContinuousLanguageIdPriority` properties have been removed. A single property `SpeechServiceConnection_LanguageIdMode` replaces them. You no longer need to prioritize between low latency and high accuracy. For continuous speech recognition or translation, you only need to select whether to run at-start or continuous Language Identification.
-
 This article provides code snippets to describe the concepts. Links to complete samples for each use case are provided.
 
 ### Candidate languages
diff --git a/articles/ai-services/speech-service/language-learning-overview.md b/articles/ai-services/speech-service/language-learning-overview.md
@@ -1,13 +1,14 @@
 ---
 title: Language learning with Azure AI Speech
 titleSuffix: Azure AI services
-description: Azure AI services for Speech can be used to learn languages.
+description: Learn about how Azure AI Speech can be used to learn languages.
 author: eric-urban
 manager: nitinme
 ms.service: azure-ai-speech
 ms.topic: conceptual
-ms.date: 9/11/2024
+ms.date: 9/20/2024
 ms.author: eur
+#Customer intent: As a language learner, I want to learn how to use Azure AI Speech to improve my language skills.
 ---
 
 # Language learning with Azure AI Speech
@@ -25,7 +26,7 @@ The Pronunciation Assessment feature offers several benefits for educators, serv
 
 ## Speech to text 
 
-Azure [Speech to text](speech-to-text.md) supports real-time language identification for multilingual language learning scenarios, help human-human interaction with better understanding and readable context.
+[Speech to text](speech-to-text.md) supports real-time language identification for multilingual language learning scenarios, help human-human interaction with better understanding and readable context.
 
 ##  Text to speech
 
@@ -36,6 +37,6 @@ Azure [Speech to text](speech-to-text.md) supports real-time language identifica
 ## Next steps
 
 * [How to use pronunciation assessment](how-to-pronunciation-assessment.md)
-* [What is Speech to text](speech-to-text.md)
-* [What is Text to speech](text-to-speech.md)
+* [What is speech to text](speech-to-text.md)
+* [What is text to speech](text-to-speech.md)
 * [What is custom neural voice](custom-neural-voice.md)
diff --git a/articles/ai-services/speech-service/language-learning-with-pronunciation-assessment.md b/articles/ai-services/speech-service/language-learning-with-pronunciation-assessment.md
@@ -5,8 +5,9 @@ author: eric-urban
 manager: nitinme
 ms.service: azure-ai-speech
 ms.topic: how-to
-ms.date: 8/1/2024
+ms.date: 9/20/2024
 ms.author: eur
+#Customer intent: As a language learner, I want to learn how to use Azure AI Speech to improve my language skills.
 ---
 
 # Interactive language learning with pronunciation assessment
diff --git a/articles/ai-services/speech-service/language-support.md b/articles/ai-services/speech-service/language-support.md
@@ -6,9 +6,10 @@ author: eric-urban
 manager: nitinme
 ms.service: azure-ai-speech
 ms.topic: conceptual
-ms.date: 8/20/2024
+ms.date: 9/20/2024
 ms.author: eur
 ms.custom: references_regions, build-2024
+#Customer intent: As a developer, I want to learn about the languages supported by the Speech service.
 ---
 
 # Language and voice support for the Speech service
@@ -26,15 +27,15 @@ You can also get a list of locales and voices supported for each specific region
 Language support varies by Speech service functionality. 
 
 > [!NOTE]
-> See [Speech Containers](speech-container-overview.md#available-speech-containers) and [Embedded Speech](embedded-speech.md#models-and-voices) separately for their supported languages.
+> See [speech containers](speech-container-overview.md#available-speech-containers) and [embedded speech](embedded-speech.md#models-and-voices) documentation for their supported languages.
 
 **Choose a Speech feature**
 
 # [Speech to text](#tab/stt)
 
-The table in this section summarizes the locales supported for Speech to text. See the table footnotes for more details. 
+The table in this section summarizes the locales supported for speech to text. For details, see the table footnotes.
 
-More remarks for Speech to text locales are included in the [custom speech](#custom-speech) section of this article. 
+More remarks for speech to text locales are included in the [custom speech](#custom-speech) section of this article. 
 
 > [!TIP]
 > Try out the [real-time speech to text tool](https://speech.microsoft.com/portal/speechtotexttool) without having to use any code.
@@ -43,7 +44,7 @@ More remarks for Speech to text locales are included in the [custom speech](#cus
 
 ### Custom speech
 
-To improve Speech to text recognition accuracy, customization is available for some languages and base models. Depending on the locale, you can upload audio + human-labeled transcripts, plain text, structured text, and pronunciation data. By default, plain text customization is supported for all available base models. To learn more about customization, see [custom speech](./custom-speech-overview.md).
+To improve speech to text recognition accuracy, customization is available for some languages and base models. Depending on the locale, you can upload audio + human-labeled transcripts, plain text, structured text, and pronunciation data. By default, plain text customization is supported for all available base models. To learn more about customization, see [custom speech](./custom-speech-overview.md).
 
 These are the locales that support the [display text format feature](./how-to-custom-speech-display-text-format.md): da-DK, de-DE, en-AU, en-CA, en-GB, en-HK, en-IE, en-IN, en-NG, en-NZ, en-PH, en-SG, en-US, es-ES, es-MX, fi-FI, fr-CA, fr-FR, hi-IN, it-IT, ja-JP, ko-KR, nb-NO, nl-NL, pl-PL, pt-BR, pt-PT, sv-SE, tr-TR, zh-CN, zh-HK.
 
@@ -53,7 +54,7 @@ The supported locales for the [fast transcription API](fast-transcription-create
 
 # [Text to speech](#tab/tts)
 
-The table in this section summarizes the locales and voices supported for Text to speech. See the table footnotes for more details.
+The table in this section summarizes the locales and voices supported for text to speech. For details, see the table footnotes.
 
 More remarks for text to speech locales are included in the [voice styles and roles](#voice-styles-and-roles), [prebuilt neural voices](#prebuilt-neural-voices), [Custom neural voice](#custom-neural-voice), and [personal voice](#personal-voice) sections in this article. 
 
@@ -98,9 +99,9 @@ Each prebuilt neural voice model is available at 24kHz and high-fidelity 48kHz.
 
 Note that the following neural voices are retired.
 
-- The English (United Kingdom) voice `en-GB-MiaNeural` is retired on October 30, 2021. All service requests to `en-GB-MiaNeural` will be redirected to `en-GB-SoniaNeural` automatically as of October 30, 2021. If you're using container Neural TTS, [download](speech-container-ntts.md#get-the-container-image-with-docker-pull) and deploy the latest version. All requests with previous versions won't succeed starting from October 30, 2021.
+- The English (United Kingdom) voice `en-GB-MiaNeural` is retired on October 30, 2021. All service requests to `en-GB-MiaNeural` will be redirected to `en-GB-SoniaNeural` automatically as of October 30, 2021. If you're using containers for text to speech, [download](speech-container-ntts.md#get-the-container-image-with-docker-pull) and deploy the latest version. All requests with previous versions don't succeed starting from October 30, 2021.
 - The `en-US-JessaNeural` voice is retired and replaced by `en-US-AriaNeural`. If you were using "Jessa" before, convert  to "Aria."
-- The Chinese (Mandarin, Simplified) voice `zh-CN-XiaoxuanNeural` is retired on February 29, 2024. All service requests to `zh-CN-XiaoxuanNeural` will be redirected to `zh-CN-XiaoyiNeural` automatically as of February 29, 2024. If you're using container Neural TTS, [download](speech-container-ntts.md#get-the-container-image-with-docker-pull) and deploy the latest version. All requests with previous versions won't succeed starting from February 29, 2024.
+- The Chinese (Mandarin, Simplified) voice `zh-CN-XiaoxuanNeural` is retired on February 29, 2024. All service requests to `zh-CN-XiaoxuanNeural` will be redirected to `zh-CN-XiaoyiNeural` automatically as of February 29, 2024. If you're using containers for text to speech, [download](speech-container-ntts.md#get-the-container-image-with-docker-pull) and deploy the latest version. All requests with previous versions won't succeed starting from February 29, 2024.
 
 ### Custom neural voice
 
@@ -122,7 +123,7 @@ With the cross-lingual feature, you can transfer your custom neural voice model
 
 # [Pronunciation assessment](#tab/pronunciation-assessment)
 
-The table in this section summarizes the 33 locales supported for pronunciation assessment, and each language is available on all [Speech to text regions](regions.md#speech-service). Latest update extends support from English to 32 more languages and quality enhancements to existing features, including accuracy, fluency and miscue assessment. You should specify the language that you're learning or practicing improving pronunciation. The default language is set as `en-US`. If you know your target learning language, [set the locale](how-to-pronunciation-assessment.md#get-pronunciation-assessment-results) accordingly. For example, if you're learning British English, you should specify the language as `en-GB`. If you're teaching a broader language, such as Spanish, and are uncertain about which locale to select, you can run various accent models (`es-ES`, `es-MX`) to determine the one that achieves the highest score to suit your specific scenario. If you're interested in languages not listed in the following table, fill out this [intake form](https://aka.ms/speechpa/intake) for further assistance.
+The table in this section summarizes the 33 locales supported for pronunciation assessment, and each language is available on all [speech to text regions](regions.md#speech-service). Latest update extends support from English to 32 more languages and quality enhancements to existing features, including accuracy, fluency and miscue assessment. You should specify the language that you're learning or practicing improving pronunciation. The default language is set as `en-US`. If you know your target learning language, [set the locale](how-to-pronunciation-assessment.md#get-pronunciation-assessment-results) accordingly. For example, if you're learning British English, you should specify the language as `en-GB`. If you're teaching a broader language, such as Spanish, and are uncertain about which locale to select, you can run various accent models (`es-ES`, `es-MX`) to determine the one that achieves the highest score to suit your specific scenario. If you're interested in languages not listed in the following table, fill out this [intake form](https://aka.ms/speechpa/intake) for further assistance.
 
 [!INCLUDE [Language support include](includes/language-support/pronunciation-assessment.md)]
 
diff --git a/articles/ai-services/speech-service/logging-audio-transcription.md b/articles/ai-services/speech-service/logging-audio-transcription.md