Skip to content

Commit 928090b

Browse files
committed
update content assessment in PA
1 parent 4fb9e17 commit 928090b

File tree

1 file changed

+18
-18
lines changed

1 file changed

+18
-18
lines changed

articles/ai-services/speech-service/how-to-pronunciation-assessment.md

Lines changed: 18 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -203,7 +203,7 @@ let recognizer = try! SPXSpeechRecognizer(speechConfiguration: speechConfig, lan
203203
> [!TIP]
204204
> If you aren't sure which locale to set for a language that has multiple locales, try each locale separately. For instance, for Spanish, try `es-ES` and `es-MX`. Determine which locale scores higher for your scenario.
205205
206-
You must create a `PronunciationAssessmentConfig` object. You can set `EnableProsodyAssessment` and `EnableContentAssessmentWithTopic` to enable prosody and content assessment. For more information, see [configuration methods](#configuration-methods).
206+
You must create a `PronunciationAssessmentConfig` object. You can set `EnableProsodyAssessment` to enable prosody assessment. For more information, see [configuration methods](#configuration-methods).
207207

208208
::: zone pivot="programming-language-csharp"
209209

@@ -214,19 +214,21 @@ var pronunciationAssessmentConfig = new PronunciationAssessmentConfig(
214214
granularity: Granularity.Phoneme,
215215
enableMiscue: false);
216216
pronunciationAssessmentConfig.EnableProsodyAssessment();
217-
pronunciationAssessmentConfig.EnableContentAssessmentWithTopic("greeting");
218217
```
219218

219+
We suggest to use LLM tools with recognized text for content assessment. The sample code for content assessment can be found on [GitHub](https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/csharp/sharedcontent/console/speech_recognition_samples.cs) under the function `PronunciationAssessmentWithContentAssessment`.
220+
220221
::: zone-end
221222

222223
::: zone pivot="programming-language-cpp"
223224

224225
```cpp
225226
auto pronunciationConfig = PronunciationAssessmentConfig::Create("", PronunciationAssessmentGradingSystem::HundredMark, PronunciationAssessmentGranularity::Phoneme, false);
226227
pronunciationConfig->EnableProsodyAssessment();
227-
pronunciationConfig->EnableContentAssessmentWithTopic("greeting");
228228
```
229229

230+
We suggest to use LLM tools with recognized text for content assessment. The sample code for content assessment can be found on [GitHub](https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/cpp/windows/console/samples/speech_recognition_samples.cpp) under the function `PronunciationAssessmentWithContentAssessment`.
231+
230232
::: zone-end
231233

232234
::: zone pivot="programming-language-java"
@@ -235,9 +237,10 @@ pronunciationConfig->EnableContentAssessmentWithTopic("greeting");
235237
PronunciationAssessmentConfig pronunciationConfig = new PronunciationAssessmentConfig("",
236238
PronunciationAssessmentGradingSystem.HundredMark, PronunciationAssessmentGranularity.Phoneme, false);
237239
pronunciationConfig.enableProsodyAssessment();
238-
pronunciationConfig.enableContentAssessmentWithTopic("greeting");
239240
```
240241

242+
We suggest to use LLM tools with recognized text for content assessment. The sample code for content assessment can be found on [GitHub](https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/java/jre/console/src/com/microsoft/cognitiveservices/speech/samples/console/SpeechRecognitionSamples.java) under the function `pronunciationAssessmentWithContentAssessment`.
243+
241244
::: zone-end
242245

243246
::: zone pivot="programming-language-python"
@@ -252,6 +255,8 @@ pronunciation_config.enable_prosody_assessment()
252255
pronunciation_config.enable_content_assessment_with_topic("greeting")
253256
```
254257

258+
We suggest to use LLM tools with recognized text for content assessment. The sample code for content assessment can be found on [GitHub](https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/python/console/speech_sample.py) under the function `pronunciation_assessment_with_content_assessment`.
259+
255260
::: zone-end
256261

257262
::: zone pivot="programming-language-javascript"
@@ -263,9 +268,10 @@ var pronunciationAssessmentConfig = new sdk.PronunciationAssessmentConfig(
263268
granularity: sdk.PronunciationAssessmentGranularity.Phoneme,
264269
enableMiscue: false);
265270
pronunciationAssessmentConfig.enableProsodyAssessment();
266-
pronunciationAssessmentConfig.enableContentAssessmentWithTopic("greeting");
267271
```
268272

273+
We suggest to use LLM tools with recognized text for content assessment. The sample code for content assessment can be found on [GitHub](https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/js/node/pronunciationAssessmentGetContentResult.js).
274+
269275
::: zone-end
270276

271277
::: zone pivot="programming-language-objectivec"
@@ -274,9 +280,10 @@ pronunciationAssessmentConfig.enableContentAssessmentWithTopic("greeting");
274280
SPXPronunciationAssessmentConfiguration *pronunicationConfig =
275281
[[SPXPronunciationAssessmentConfiguration alloc] init:@"" gradingSystem:SPXPronunciationAssessmentGradingSystem_HundredMark granularity:SPXPronunciationAssessmentGranularity_Phoneme enableMiscue:false];
276282
[pronunicationConfig enableProsodyAssessment];
277-
[pronunicationConfig enableContentAssessmentWithTopic:@"greeting"];
278283
```
279284
285+
We suggest to use LLM tools with recognized text for content assessment. The sample code for content assessment can be found on [GitHub](https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/objective-c/ios/speech-samples/speech-samples/ViewController.m) under the function `pronunciationAssessWithContentAssessment`.
286+
280287
::: zone-end
281288
282289
::: zone pivot="programming-language-swift"
@@ -287,9 +294,10 @@ let pronAssessmentConfig = try! SPXPronunciationAssessmentConfiguration("",
287294
granularity: .phoneme,
288295
enableMiscue: false)
289296
pronAssessmentConfig.enableProsodyAssessment()
290-
pronAssessmentConfig.enableContentAssessment(withTopic: "greeting")
291297
```
292298

299+
We suggest to use LLM tools with recognized text for content assessment. The sample code for content assessment can be found on [GitHub](https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/objective-c/ios/speech-samples/speech-samples/ViewController.m) under the function `pronunciationAssessmentWithContentAssessment`.
300+
293301
::: zone-end
294302

295303
::: zone pivot="programming-language-go"
@@ -311,16 +319,13 @@ This table lists some of the key configuration parameters for pronunciation asse
311319
This table lists some of the optional methods you can set for the `PronunciationAssessmentConfig` object.
312320

313321
> [!NOTE]
314-
> Content and prosody assessments are only available in the [en-US](./language-support.md?tabs=pronunciation-assessment) locale.
322+
> Prosody assessments are only available in the [en-US](./language-support.md?tabs=pronunciation-assessment) locale.
315323
>
316-
> To explore the content and prosody assessments, upgrade to the SDK version 1.35.0 or later.
317-
>
318-
> There is no length limit for the topic parameter.
324+
> To explore the prosody assessments, upgrade to the SDK version 1.35.0 or later.
319325
320326
| Method | Description |
321327
|-----------|-------------|
322328
| `EnableProsodyAssessment` | Enables prosody assessment for your pronunciation evaluation. This feature assesses aspects like stress, intonation, speaking speed, and rhythm. This feature provides insights into the naturalness and expressiveness of your speech.<br/><br/>Enabling prosody assessment is optional. If this method is called, the `ProsodyScore` result value is returned. |
323-
| `EnableContentAssessmentWithTopic` | Enables content assessment. A content assessment is part of the [unscripted assessment](#unscripted-assessment-results) for the speaking language learning scenario. By providing a description, you can enhance the assessment's understanding of the specific topic being spoken about. For example, in C# call `pronunciationAssessmentConfig.EnableContentAssessmentWithTopic("greeting");`. You can replace 'greeting' with your desired text to describe a topic. The description has no length limit and currently only supports the `en-US` locale. |
324329

325330
## Get pronunciation assessment results
326331

@@ -528,19 +533,14 @@ This table lists some of the key pronunciation assessment results for the script
528533

529534
This table lists some of the key pronunciation assessment results for the unscripted assessment, or speaking scenario.
530535

531-
`VocabularyScore`, `GrammarScore`, and `TopicScore` parameters roll up to the combined content assessment.
532-
533536
> [!NOTE]
534-
> Content and prosody assessments are only available in the [en-US](./language-support.md?tabs=pronunciation-assessment) locale.
537+
> Prosody assessments are only available in the [en-US](./language-support.md?tabs=pronunciation-assessment) locale.
535538
536539
| Response parameter | Description | Granularity |
537540
|:-------------------|:------------|:------------|
538541
| `AccuracyScore` | Pronunciation accuracy of the speech. Accuracy indicates how closely the phonemes match a native speaker's pronunciation. Syllable, word, and full text accuracy scores are aggregated from phoneme-level accuracy score, and refined with assessment objectives. | Phoneme level,<br>Syllable level (en-US only),<br>Word level,<br>Full Text level |
539542
| `FluencyScore` | Fluency of the given speech. Fluency indicates how closely the speech matches a native speaker's use of silent breaks between words. | Full Text level |
540543
| `ProsodyScore` | Prosody of the given speech. Prosody indicates how natural the given speech is, including stress, intonation, speaking speed, and rhythm. | Full Text level |
541-
| `VocabularyScore` | Proficiency in lexical usage. It evaluates the speaker's effective usage of words and their appropriateness within the given context to express ideas accurately, and the level of lexical complexity. | Full Text level |
542-
| `GrammarScore` | Correctness in using grammar and variety of sentence patterns. Lexical accuracy, grammatical accuracy, and diversity of sentence structures jointly elevate grammatical errors. | Full Text level|
543-
| `TopicScore` | Level of understanding and engagement with the topic, which provides insights into the speaker’s ability to express their thoughts and ideas effectively and the ability to engage with the topic. | Full Text level|
544544
| `PronScore` | Overall score of the pronunciation quality of the given speech. `PronScore` is calculated from `AccuracyScore`, `FluencyScore`, and `ProsodyScore` with weight, provided that `ProsodyScore` is available. If `ProsodyScore` isn't available, `PronScore` won't consider that score.| Full Text level |
545545
| `ErrorType` | A word is badly pronounced, improperly inserted with a break, or missing a break at punctuation. It also indicates whether a pronunciation is monotonically rising, falling, or flat on the utterance. Possible values are `None` for no error on this word, `Mispronunciation`, `UnexpectedBreak`, `MissingBreak`, and `Monotone`. | Word level |
546546

0 commit comments

Comments
 (0)