Skip to content

Commit 6df0539

Browse files
authored
Merge pull request #406 from eric-urban/eur/speech-refresh-3
refresh speech docs
2 parents 0616c72 + 6ce1d34 commit 6df0539

12 files changed

+88
-75
lines changed

articles/ai-services/speech-service/improve-accuracy-phrase-list.md

Lines changed: 7 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,21 +1,21 @@
11
---
22
title: Improve recognition accuracy with phrase list
33
description: Phrase lists can be used to customize speech recognition results based on context.
4-
author: ut-karsh
5-
ms.author: umaheshwari
4+
author: eric-urban
5+
ms.author: eur
6+
ms.reviewer: umaheshwari
67
ms.service: azure-ai-speech
78
ms.custom: devx-track-extended-java, devx-track-js, devx-track-python
89
ms.topic: how-to
9-
ms.date: 1/21/2024
10+
ms.date: 9/20/2024
1011
zone_pivot_groups: programming-languages-set-two-with-js-spx
12+
#Customer intent: As a developer using speech to text, I want to learn how to improve recognition accuracy with phrase list.
1113
---
1214

1315
# Improve recognition accuracy with phrase list
1416

1517
A phrase list is a list of words or phrases provided ahead of time to help improve their recognition. Adding a phrase to a phrase list increases its importance, thus making it more likely to be recognized.
1618

17-
For supported phrase list locales, see [Language and voice support for the Speech service](language-support.md?tabs=phraselist).
18-
1919
Examples of phrases include:
2020
* Names
2121
* Geographical locations
@@ -26,6 +26,8 @@ Phrase lists are simple and lightweight:
2626
- **Just-in-time**: A phrase list is provided just before starting the speech recognition, eliminating the need to train a custom model.
2727
- **Lightweight**: You don't need a large data set. Provide a word or phrase to boost its recognition.
2828

29+
For supported phrase list locales, see [Language and voice support for the Speech service](language-support.md?tabs=phraselist).
30+
2931
You can use phrase lists with the [Speech Studio](speech-studio-overview.md), [Speech SDK](quickstarts/setup-platform.md), or [Speech Command Line Interface (CLI)](spx-overview.md). The [Batch transcription API](batch-transcription.md) doesn't support phrase lists.
3032

3133
You can use phrase lists with both standard and [custom speech](custom-speech-overview.md). There are some situations where training a custom model that includes phrases is likely the best option to improve accuracy. For example, in the following cases you would use custom speech:

articles/ai-services/speech-service/index-speech-to-text.yml

Lines changed: 15 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ metadata:
99
manager: nitinme
1010
ms.service: azure-ai-speech
1111
ms.topic: landing-page
12-
ms.date: 8/20/2024
12+
ms.date: 9/20/2024
1313
ms.author: eur
1414

1515
landingContent:
@@ -29,26 +29,28 @@ landingContent:
2929
links:
3030
- text: Get started with speech to text
3131
url: get-started-speech-to-text.md
32+
- text: Try real-time diarization
33+
url: get-started-stt-diarization.md
3234
- title: Develop with speech to text
3335
linkLists:
3436
- linkListType: how-to-guide
3537
links:
36-
- text: Choose speech recognition mode
37-
url: ./get-started-speech-to-text.md
38-
- text: Improve accuracy with custom speech
39-
url: ./custom-speech-overview.md
38+
- text: Use the fast transcription API
39+
url: fast-transcription-create.md
40+
- text: Create a custom speech project
41+
url: ./how-to-custom-speech-create-project.md
42+
- text: Train a model for custom speech
43+
url: how-to-custom-speech-train-model.md
4044
- text: Use compressed audio input formats
4145
url: how-to-use-codec-compressed-audio-input-streams.md
42-
- text: Migrate from v3.0 to v3.1
43-
url: migrate-v3-0-to-v3-1.md
4446
- linkListType: concept
4547
links:
46-
- text: Training and testing datasets
47-
url: how-to-custom-speech-test-and-train.md
48-
- text: Train a model for custom speech
49-
url: how-to-custom-speech-train-model.md
50-
- text: Create human-labeled transcriptions
51-
url: how-to-custom-speech-human-labeled-transcriptions.md
48+
- text: Whisper model from OpenAI
49+
url: whisper-overview.md
50+
- text: Improve accuracy with custom speech
51+
url: ./custom-speech-overview.md
52+
- text: Display text formatting
53+
url: display-text-format.md
5254
- title: Reference
5355
linkLists:
5456
- linkListType: reference

articles/ai-services/speech-service/index-text-to-speech.yml

Lines changed: 9 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ metadata:
99
manager: nitinme
1010
ms.service: azure-ai-speech
1111
ms.topic: landing-page
12-
ms.date: 8/20/2024
12+
ms.date: 9/20/2024
1313
ms.author: eur
1414

1515
landingContent:
@@ -29,18 +29,20 @@ landingContent:
2929
linkLists:
3030
- linkListType: how-to-guide
3131
links:
32-
- text: Improve synthesis with SSML
33-
url: speech-synthesis-markup.md
3432
- text: Batch synthesis for long-form text
3533
url: batch-synthesis.md
36-
- linkListType: concept
37-
links:
38-
- text: What is custom neural voice?
39-
url: custom-neural-voice.md
4034
- text: Get started with custom voice
4135
url: professional-voice-create-project.md
4236
- text: Create and use custom voice models
4337
url: professional-voice-train-voice.md
38+
- text: Create audio content in Speech Studio
39+
url: how-to-audio-content-creation.md
40+
- linkListType: concept
41+
links:
42+
- text: What is custom neural voice?
43+
url: custom-neural-voice.md
44+
- text: Improve synthesis with SSML
45+
url: speech-synthesis-markup.md
4446
- title: Reference
4547
linkLists:
4648
- linkListType: reference

articles/ai-services/speech-service/ingestion-client.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,8 +6,9 @@ author: eric-urban
66
manager: nitinme
77
ms.service: azure-ai-speech
88
ms.topic: conceptual
9-
ms.date: 1/21/2024
9+
ms.date: 9/20/2024
1010
ms.author: eur
11+
#Customer intent: As a developer, I want to learn about the Ingestion Client tool that helps me quickly deploy a call center transcription solution to Azure with a no-code approach.
1112
---
1213

1314
# Ingestion Client with Azure AI services

articles/ai-services/speech-service/intent-recognition.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -7,8 +7,8 @@ ms.author: eur
77
manager: nitinme
88
ms.service: azure-ai-speech
99
ms.topic: overview
10-
ms.date: 1/21/2024
11-
keywords: intent recognition
10+
ms.date: 9/20/2024
11+
#Customer intent: As a developer, I want to learn about intent recognition and how to use it with the Speech service.
1212
---
1313

1414
# What is intent recognition?

articles/ai-services/speech-service/keyword-recognition-guidelines.md

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -2,12 +2,14 @@
22
title: Keyword recognition recommendations and guidelines - Speech service
33
titleSuffix: Azure AI services
44
description: An overview of recommendations and guidelines when using keyword recognition.
5-
author: hasyashah
5+
author: eric-urban
6+
ms.author: eur
67
manager: nitinme
78
ms.service: azure-ai-speech
89
ms.topic: conceptual
9-
ms.date: 1/21/2024
10-
ms.author: hasshah
10+
ms.date: 9/20/2024
11+
ms.reviewer: hasshah
12+
#Customer intent: As a developer, I want to learn about recommendations and guidelines for keyword recognition with the Speech service.
1113
---
1214

1315
# Recommendations and guidelines for keyword recognition

articles/ai-services/speech-service/keyword-recognition-overview.md

Lines changed: 6 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -2,12 +2,14 @@
22
title: Keyword recognition overview - Speech service
33
titleSuffix: Azure AI services
44
description: An overview of the features, capabilities, and restrictions for keyword recognition by using the Speech SDK.
5-
author: hasyashah
5+
author: eric-urban
6+
ms.author: eur
67
manager: nitinme
78
ms.service: azure-ai-speech
89
ms.topic: overview
9-
ms.date: 1/21/2024
10-
ms.author: hasshah
10+
ms.date: 9/20/2024
11+
ms.reviewer: hasshah
12+
#Customer intent: As a developer, I want to learn about keyword recognition and how to use it with the Speech service.
1113
---
1214

1315
# What is keyword recognition?
@@ -26,7 +28,7 @@ The current system is designed with multiple stages that span the edge and cloud
2628

2729
Accuracy of keyword recognition is measured via the following metrics:
2830

29-
* **Correct accept rate**: Measures the system's ability to recognize the keyword when it's spoken by a user. The correct accept rate is also known as the true positive rate.
31+
* **Correct accept rate**: Measures the system's ability to recognize the keyword spoken by a user. The correct accept rate is also known as the true positive rate.
3032
* **False accept rate**: Measures the system's ability to filter out audio that isn't the keyword spoken by a user. The false accept rate is also known as the false positive rate.
3133

3234
The goal is to maximize the correct accept rate while minimizing the false accept rate. The current system is designed to detect a keyword or phrase preceded by a short amount of silence. Detecting a keyword in the middle of a sentence or utterance isn't supported.

articles/ai-services/speech-service/language-identification.md

Lines changed: 1 addition & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ manager: nitinme
77
ms.service: azure-ai-speech
88
ms.custom: devx-track-extended-java, devx-track-js, devx-track-python
99
ms.topic: how-to
10-
ms.date: 02/08/2024
10+
ms.date: 9/20/2024
1111
ms.author: eur
1212
zone_pivot_groups: programming-languages-speech-services-nomore-variant
1313
#customer intent: As an application developer, I want to use language recognition or translations in order to make my apps work seamlessly for more customers.
@@ -33,9 +33,6 @@ Whether you use language identification with [speech to text](#use-speech-to-tex
3333

3434
Then you make a [recognize once or continuous recognition](#recognize-once-or-continuous) request to the Speech service.
3535

36-
> [!IMPORTANT]
37-
> Language Identification APIs are simplified with the Speech SDK version 1.25 and later. The `SpeechServiceConnection_SingleLanguageIdPriority` and `SpeechServiceConnection_ContinuousLanguageIdPriority` properties have been removed. A single property `SpeechServiceConnection_LanguageIdMode` replaces them. You no longer need to prioritize between low latency and high accuracy. For continuous speech recognition or translation, you only need to select whether to run at-start or continuous Language Identification.
38-
3936
This article provides code snippets to describe the concepts. Links to complete samples for each use case are provided.
4037

4138
### Candidate languages

articles/ai-services/speech-service/language-learning-overview.md

Lines changed: 6 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,13 +1,14 @@
11
---
22
title: Language learning with Azure AI Speech
33
titleSuffix: Azure AI services
4-
description: Azure AI services for Speech can be used to learn languages.
4+
description: Learn about how Azure AI Speech can be used to learn languages.
55
author: eric-urban
66
manager: nitinme
77
ms.service: azure-ai-speech
88
ms.topic: conceptual
9-
ms.date: 9/11/2024
9+
ms.date: 9/20/2024
1010
ms.author: eur
11+
#Customer intent: As a language learner, I want to learn how to use Azure AI Speech to improve my language skills.
1112
---
1213

1314
# Language learning with Azure AI Speech
@@ -25,7 +26,7 @@ The Pronunciation Assessment feature offers several benefits for educators, serv
2526

2627
## Speech to text
2728

28-
Azure [Speech to text](speech-to-text.md) supports real-time language identification for multilingual language learning scenarios, help human-human interaction with better understanding and readable context.
29+
[Speech to text](speech-to-text.md) supports real-time language identification for multilingual language learning scenarios, help human-human interaction with better understanding and readable context.
2930

3031
## Text to speech
3132

@@ -36,6 +37,6 @@ Azure [Speech to text](speech-to-text.md) supports real-time language identifica
3637
## Next steps
3738

3839
* [How to use pronunciation assessment](how-to-pronunciation-assessment.md)
39-
* [What is Speech to text](speech-to-text.md)
40-
* [What is Text to speech](text-to-speech.md)
40+
* [What is speech to text](speech-to-text.md)
41+
* [What is text to speech](text-to-speech.md)
4142
* [What is custom neural voice](custom-neural-voice.md)

articles/ai-services/speech-service/language-learning-with-pronunciation-assessment.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,8 +5,9 @@ author: eric-urban
55
manager: nitinme
66
ms.service: azure-ai-speech
77
ms.topic: how-to
8-
ms.date: 8/1/2024
8+
ms.date: 9/20/2024
99
ms.author: eur
10+
#Customer intent: As a language learner, I want to learn how to use Azure AI Speech to improve my language skills.
1011
---
1112

1213
# Interactive language learning with pronunciation assessment

0 commit comments

Comments
 (0)