Skip to content

Commit 352afa1

Browse files
Merge pull request #3369 from yaden-ms/yaden/update_PA_doc
pronunciation assessment update
2 parents 06facf0 + d89512f commit 352afa1

File tree

1 file changed

+4
-18
lines changed

1 file changed

+4
-18
lines changed

articles/ai-services/speech-service/how-to-pronunciation-assessment.md

Lines changed: 4 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -203,7 +203,7 @@ let recognizer = try! SPXSpeechRecognizer(speechConfiguration: speechConfig, lan
203203
> [!TIP]
204204
> If you aren't sure which locale to set for a language that has multiple locales, try each locale separately. For instance, for Spanish, try `es-ES` and `es-MX`. Determine which locale scores higher for your scenario.
205205
206-
You must create a `PronunciationAssessmentConfig` object. You can set `EnableProsodyAssessment` and `EnableContentAssessmentWithTopic` to enable prosody and content assessment. For more information, see [configuration methods](#configuration-methods).
206+
You must create a `PronunciationAssessmentConfig` object. You can set `EnableProsodyAssessment` to enable prosody assessment. For more information, see [configuration methods](#configuration-methods).
207207

208208
::: zone pivot="programming-language-csharp"
209209

@@ -214,7 +214,6 @@ var pronunciationAssessmentConfig = new PronunciationAssessmentConfig(
214214
granularity: Granularity.Phoneme,
215215
enableMiscue: false);
216216
pronunciationAssessmentConfig.EnableProsodyAssessment();
217-
pronunciationAssessmentConfig.EnableContentAssessmentWithTopic("greeting");
218217
```
219218

220219
::: zone-end
@@ -224,7 +223,6 @@ pronunciationAssessmentConfig.EnableContentAssessmentWithTopic("greeting");
224223
```cpp
225224
auto pronunciationConfig = PronunciationAssessmentConfig::Create("", PronunciationAssessmentGradingSystem::HundredMark, PronunciationAssessmentGranularity::Phoneme, false);
226225
pronunciationConfig->EnableProsodyAssessment();
227-
pronunciationConfig->EnableContentAssessmentWithTopic("greeting");
228226
```
229227

230228
::: zone-end
@@ -235,7 +233,6 @@ pronunciationConfig->EnableContentAssessmentWithTopic("greeting");
235233
PronunciationAssessmentConfig pronunciationConfig = new PronunciationAssessmentConfig("",
236234
PronunciationAssessmentGradingSystem.HundredMark, PronunciationAssessmentGranularity.Phoneme, false);
237235
pronunciationConfig.enableProsodyAssessment();
238-
pronunciationConfig.enableContentAssessmentWithTopic("greeting");
239236
```
240237

241238
::: zone-end
@@ -263,7 +260,6 @@ var pronunciationAssessmentConfig = new sdk.PronunciationAssessmentConfig(
263260
granularity: sdk.PronunciationAssessmentGranularity.Phoneme,
264261
enableMiscue: false);
265262
pronunciationAssessmentConfig.enableProsodyAssessment();
266-
pronunciationAssessmentConfig.enableContentAssessmentWithTopic("greeting");
267263
```
268264

269265
::: zone-end
@@ -274,7 +270,6 @@ pronunciationAssessmentConfig.enableContentAssessmentWithTopic("greeting");
274270
SPXPronunciationAssessmentConfiguration *pronunicationConfig =
275271
[[SPXPronunciationAssessmentConfiguration alloc] init:@"" gradingSystem:SPXPronunciationAssessmentGradingSystem_HundredMark granularity:SPXPronunciationAssessmentGranularity_Phoneme enableMiscue:false];
276272
[pronunicationConfig enableProsodyAssessment];
277-
[pronunicationConfig enableContentAssessmentWithTopic:@"greeting"];
278273
```
279274
280275
::: zone-end
@@ -287,7 +282,6 @@ let pronAssessmentConfig = try! SPXPronunciationAssessmentConfiguration("",
287282
granularity: .phoneme,
288283
enableMiscue: false)
289284
pronAssessmentConfig.enableProsodyAssessment()
290-
pronAssessmentConfig.enableContentAssessment(withTopic: "greeting")
291285
```
292286

293287
::: zone-end
@@ -311,16 +305,13 @@ This table lists some of the key configuration parameters for pronunciation asse
311305
This table lists some of the optional methods you can set for the `PronunciationAssessmentConfig` object.
312306

313307
> [!NOTE]
314-
> Content and prosody assessments are only available in the [en-US](./language-support.md?tabs=pronunciation-assessment) locale.
308+
> Prosody assessment is only available in the [en-US](./language-support.md?tabs=pronunciation-assessment) locale.
315309
>
316-
> To explore the content and prosody assessments, upgrade to the SDK version 1.35.0 or later.
317-
>
318-
> There is no length limit for the topic parameter.
310+
> To explore the prosody assessment, upgrade to the SDK version 1.35.0 or later.
319311
320312
| Method | Description |
321313
|-----------|-------------|
322314
| `EnableProsodyAssessment` | Enables prosody assessment for your pronunciation evaluation. This feature assesses aspects like stress, intonation, speaking speed, and rhythm. This feature provides insights into the naturalness and expressiveness of your speech.<br/><br/>Enabling prosody assessment is optional. If this method is called, the `ProsodyScore` result value is returned. |
323-
| `EnableContentAssessmentWithTopic` | Enables content assessment. A content assessment is part of the [unscripted assessment](#unscripted-assessment-results) for the speaking language learning scenario. By providing a description, you can enhance the assessment's understanding of the specific topic being spoken about. For example, in C# call `pronunciationAssessmentConfig.EnableContentAssessmentWithTopic("greeting");`. You can replace 'greeting' with your desired text to describe a topic. The description has no length limit and currently only supports the `en-US` locale. |
324315

325316
## Get pronunciation assessment results
326317

@@ -528,19 +519,14 @@ This table lists some of the key pronunciation assessment results for the script
528519

529520
This table lists some of the key pronunciation assessment results for the unscripted assessment, or speaking scenario.
530521

531-
`VocabularyScore`, `GrammarScore`, and `TopicScore` parameters roll up to the combined content assessment.
532-
533522
> [!NOTE]
534-
> Content and prosody assessments are only available in the [en-US](./language-support.md?tabs=pronunciation-assessment) locale.
523+
> Prosody assessment is only available in the [en-US](./language-support.md?tabs=pronunciation-assessment) locale.
535524
536525
| Response parameter | Description | Granularity |
537526
|:-------------------|:------------|:------------|
538527
| `AccuracyScore` | Pronunciation accuracy of the speech. Accuracy indicates how closely the phonemes match a native speaker's pronunciation. Syllable, word, and full text accuracy scores are aggregated from phoneme-level accuracy score, and refined with assessment objectives. | Phoneme level,<br>Syllable level (en-US only),<br>Word level,<br>Full Text level |
539528
| `FluencyScore` | Fluency of the given speech. Fluency indicates how closely the speech matches a native speaker's use of silent breaks between words. | Full Text level |
540529
| `ProsodyScore` | Prosody of the given speech. Prosody indicates how natural the given speech is, including stress, intonation, speaking speed, and rhythm. | Full Text level |
541-
| `VocabularyScore` | Proficiency in lexical usage. It evaluates the speaker's effective usage of words and their appropriateness within the given context to express ideas accurately, and the level of lexical complexity. | Full Text level |
542-
| `GrammarScore` | Correctness in using grammar and variety of sentence patterns. Lexical accuracy, grammatical accuracy, and diversity of sentence structures jointly elevate grammatical errors. | Full Text level|
543-
| `TopicScore` | Level of understanding and engagement with the topic, which provides insights into the speaker’s ability to express their thoughts and ideas effectively and the ability to engage with the topic. | Full Text level|
544530
| `PronScore` | Overall score of the pronunciation quality of the given speech. `PronScore` is calculated from `AccuracyScore`, `FluencyScore`, and `ProsodyScore` with weight, provided that `ProsodyScore` is available. If `ProsodyScore` isn't available, `PronScore` won't consider that score.| Full Text level |
545531
| `ErrorType` | A word is badly pronounced, improperly inserted with a break, or missing a break at punctuation. It also indicates whether a pronunciation is monotonically rising, falling, or flat on the utterance. Possible values are `None` for no error on this word, `Mispronunciation`, `UnexpectedBreak`, `MissingBreak`, and `Monotone`. | Word level |
546532

0 commit comments

Comments
 (0)