You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/speech-service/how-to-pronunciation-assessment.md
+18-18Lines changed: 18 additions & 18 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -203,7 +203,7 @@ let recognizer = try! SPXSpeechRecognizer(speechConfiguration: speechConfig, lan
203
203
> [!TIP]
204
204
> If you aren't sure which locale to set for a language that has multiple locales, try each locale separately. For instance, for Spanish, try `es-ES` and `es-MX`. Determine which locale scores higher for your scenario.
205
205
206
-
You must create a `PronunciationAssessmentConfig` object. You can set `EnableProsodyAssessment`and `EnableContentAssessmentWithTopic`to enable prosody and content assessment. For more information, see [configuration methods](#configuration-methods).
206
+
You must create a `PronunciationAssessmentConfig` object. You can set `EnableProsodyAssessment` to enable prosody assessment. For more information, see [configuration methods](#configuration-methods).
207
207
208
208
::: zone pivot="programming-language-csharp"
209
209
@@ -214,19 +214,21 @@ var pronunciationAssessmentConfig = new PronunciationAssessmentConfig(
We suggest to use LLM tools with recognized text for content assessment. The sample code for content assessment can be found on [GitHub](https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/csharp/sharedcontent/console/speech_recognition_samples.cs) under the function `PronunciationAssessmentWithContentAssessment`.
220
+
220
221
::: zone-end
221
222
222
223
::: zone pivot="programming-language-cpp"
223
224
224
225
```cpp
225
226
auto pronunciationConfig = PronunciationAssessmentConfig::Create("", PronunciationAssessmentGradingSystem::HundredMark, PronunciationAssessmentGranularity::Phoneme, false);
We suggest to use LLM tools with recognized text for content assessment. The sample code for content assessment can be found on [GitHub](https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/cpp/windows/console/samples/speech_recognition_samples.cpp) under the function `PronunciationAssessmentWithContentAssessment`.
We suggest to use LLM tools with recognized text for content assessment. The sample code for content assessment can be found on [GitHub](https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/java/jre/console/src/com/microsoft/cognitiveservices/speech/samples/console/SpeechRecognitionSamples.java) under the function `pronunciationAssessmentWithContentAssessment`.
We suggest to use LLM tools with recognized text for content assessment. The sample code for content assessment can be found on [GitHub](https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/python/console/speech_sample.py) under the function `pronunciation_assessment_with_content_assessment`.
259
+
255
260
::: zone-end
256
261
257
262
::: zone pivot="programming-language-javascript"
@@ -263,9 +268,10 @@ var pronunciationAssessmentConfig = new sdk.PronunciationAssessmentConfig(
We suggest to use LLM tools with recognized text for content assessment. The sample code for content assessment can be found on [GitHub](https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/js/node/pronunciationAssessmentGetContentResult.js).
We suggest to use LLM tools with recognized text for content assessment. The sample code for content assessment can be found on [GitHub](https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/objective-c/ios/speech-samples/speech-samples/ViewController.m) under the function `pronunciationAssessWithContentAssessment`.
286
+
280
287
::: zone-end
281
288
282
289
::: zone pivot="programming-language-swift"
@@ -287,9 +294,10 @@ let pronAssessmentConfig = try! SPXPronunciationAssessmentConfiguration("",
We suggest to use LLM tools with recognized text for content assessment. The sample code for content assessment can be found on [GitHub](https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/objective-c/ios/speech-samples/speech-samples/ViewController.m) under the function `pronunciationAssessmentWithContentAssessment`.
300
+
293
301
::: zone-end
294
302
295
303
::: zone pivot="programming-language-go"
@@ -311,16 +319,13 @@ This table lists some of the key configuration parameters for pronunciation asse
311
319
This table lists some of the optional methods you can set for the `PronunciationAssessmentConfig` object.
312
320
313
321
> [!NOTE]
314
-
> Content and prosody assessments are only available in the [en-US](./language-support.md?tabs=pronunciation-assessment) locale.
322
+
> Prosody assessments are only available in the [en-US](./language-support.md?tabs=pronunciation-assessment) locale.
315
323
>
316
-
> To explore the content and prosody assessments, upgrade to the SDK version 1.35.0 or later.
317
-
>
318
-
> There is no length limit for the topic parameter.
324
+
> To explore the prosody assessments, upgrade to the SDK version 1.35.0 or later.
319
325
320
326
| Method | Description |
321
327
|-----------|-------------|
322
328
|`EnableProsodyAssessment`| Enables prosody assessment for your pronunciation evaluation. This feature assesses aspects like stress, intonation, speaking speed, and rhythm. This feature provides insights into the naturalness and expressiveness of your speech.<br/><br/>Enabling prosody assessment is optional. If this method is called, the `ProsodyScore` result value is returned. |
323
-
|`EnableContentAssessmentWithTopic`| Enables content assessment. A content assessment is part of the [unscripted assessment](#unscripted-assessment-results) for the speaking language learning scenario. By providing a description, you can enhance the assessment's understanding of the specific topic being spoken about. For example, in C# call `pronunciationAssessmentConfig.EnableContentAssessmentWithTopic("greeting");`. You can replace 'greeting' with your desired text to describe a topic. The description has no length limit and currently only supports the `en-US` locale. |
324
329
325
330
## Get pronunciation assessment results
326
331
@@ -528,19 +533,14 @@ This table lists some of the key pronunciation assessment results for the script
528
533
529
534
This table lists some of the key pronunciation assessment results for the unscripted assessment, or speaking scenario.
530
535
531
-
`VocabularyScore`, `GrammarScore`, and `TopicScore` parameters roll up to the combined content assessment.
532
-
533
536
> [!NOTE]
534
-
> Content and prosody assessments are only available in the [en-US](./language-support.md?tabs=pronunciation-assessment) locale.
537
+
> Prosody assessments are only available in the [en-US](./language-support.md?tabs=pronunciation-assessment) locale.
|`AccuracyScore`| Pronunciation accuracy of the speech. Accuracy indicates how closely the phonemes match a native speaker's pronunciation. Syllable, word, and full text accuracy scores are aggregated from phoneme-level accuracy score, and refined with assessment objectives. | Phoneme level,<br>Syllable level (en-US only),<br>Word level,<br>Full Text level |
539
542
|`FluencyScore`| Fluency of the given speech. Fluency indicates how closely the speech matches a native speaker's use of silent breaks between words. | Full Text level |
540
543
|`ProsodyScore`| Prosody of the given speech. Prosody indicates how natural the given speech is, including stress, intonation, speaking speed, and rhythm. | Full Text level |
541
-
|`VocabularyScore`| Proficiency in lexical usage. It evaluates the speaker's effective usage of words and their appropriateness within the given context to express ideas accurately, and the level of lexical complexity. | Full Text level |
542
-
|`GrammarScore`| Correctness in using grammar and variety of sentence patterns. Lexical accuracy, grammatical accuracy, and diversity of sentence structures jointly elevate grammatical errors. | Full Text level|
543
-
|`TopicScore`| Level of understanding and engagement with the topic, which provides insights into the speaker’s ability to express their thoughts and ideas effectively and the ability to engage with the topic. | Full Text level|
544
544
|`PronScore`| Overall score of the pronunciation quality of the given speech. `PronScore` is calculated from `AccuracyScore`, `FluencyScore`, and `ProsodyScore` with weight, provided that `ProsodyScore` is available. If `ProsodyScore` isn't available, `PronScore` won't consider that score.| Full Text level |
545
545
|`ErrorType`| A word is badly pronounced, improperly inserted with a break, or missing a break at punctuation. It also indicates whether a pronunciation is monotonically rising, falling, or flat on the utterance. Possible values are `None` for no error on this word, `Mispronunciation`, `UnexpectedBreak`, `MissingBreak`, and `Monotone`. | Word level |
0 commit comments