Merge pull request #227519 from jimxiei/jimxie/update_cts_nr

prmerger-automator[bot] · web-flow · commit 68e76cff546f · 2023-02-21T18:59:55.000Z
[CTS] Add NR result note and correct python sample code error.
diff --git a/articles/cognitive-services/Speech-Service/includes/how-to/conversation-transcription/real-time-csharp.md b/articles/cognitive-services/Speech-Service/includes/how-to/conversation-transcription/real-time-csharp.md
@@ -108,6 +108,8 @@ This sample code does the following:
 > [!NOTE]
 > `AudioStreamReader` is a helper class you can get on [GitHub](https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/quickstart/csharp/dotnet/conversation-transcription/helloworld/AudioStreamReader.cs).
 
+If speaker identification or differentiate is enabled, then even if you have already received `Transcribed` results, the service is still evaluating them by accumulated audio information. If the service finds that any previous result was assigned an incorrect `UserId`, then a nearly identical `Transcribed` result will be sent again, where only the `UserId` and `UtteranceId` are different. Since the `UtteranceId` format is `{index}_{UserId}_{Offset}`, when you receive a `Transcribed` result, you could use `UtteranceId` to determine if the current `Transcribed` result is going to correct a previous one. Your client or UI logic could decide behaviors, like overwriting previous output, or to ignore the latest result.
+
 Call the function `TranscribeConversationsAsync()` to start conversation transcription.
 
 ```csharp
diff --git a/articles/cognitive-services/Speech-Service/includes/how-to/conversation-transcription/real-time-javascript.md b/articles/cognitive-services/Speech-Service/includes/how-to/conversation-transcription/real-time-javascript.md
@@ -74,6 +74,8 @@ This sample code does the following:
 * Registers to events and begins transcription.
 * If you want to differentiate speakers without providing voice samples, please enable `DifferentiateGuestSpeakers` feature as in [Conversation Transcription Overview](../../../conversation-transcription.md). 
 
+If speaker identification or differentiate is enabled, then even if you have already received `transcribed` results, the service is still evaluating them by accumulated audio information. If the service finds that any previous result was assigned an incorrect `speakerId`, then a nearly identical `Transcribed` result will be sent again, where only the `speakerId` and `UtteranceId` are different. Since the `UtteranceId` format is `{index}_{speakerId}_{Offset}`, when you receive a `transcribed` result, you could use `UtteranceId` to determine if the current `transcribed` result is going to correct a previous one. Your client or UI logic could decide behaviors, like overwriting previous output, or to ignore the latest result.
+
 ```javascript
 (function() {
     "use strict";
diff --git a/articles/cognitive-services/Speech-Service/includes/how-to/conversation-transcription/real-time-python.md b/articles/cognitive-services/Speech-Service/includes/how-to/conversation-transcription/real-time-python.md
@@ -75,6 +75,8 @@ This sample code does the following:
 * Read the whole wave files at once and stream it to sdk and begins transcription.
 * If you want to differentiate speakers without providing voice samples, please enable `DifferentiateGuestSpeakers` feature as in [Conversation Transcription Overview](../../../conversation-transcription.md). 
 
+If speaker identification or differentiate is enabled, then even if you have already received `transcribed` results, the service is still evaluating them by accumulated audio information. If the service finds that any previous result was assigned an incorrect `speakerId`, then a nearly identical `Transcribed` result will be sent again, where only the `speakerId` and `UtteranceId` are different. Since the `UtteranceId` format is `{index}_{speakerId}_{Offset}`, when you receive a `transcribed` result, you could use `UtteranceId` to determine if the current `transcribed` result is going to correct a previous one. Your client or UI logic could decide behaviors, like overwriting previous output, or to ignore the latest result.
+
 ```python
 import azure.cognitiveservices.speech as speechsdk
 import time
@@ -123,8 +125,8 @@ def conversation_transcription_differentiate_speakers():
     user1 = speechsdk.transcription.Participant("user1@example.com", "en-us", voice_signature_user1)
     user2 = speechsdk.transcription.Participant("user2@example.com", "en-us", voice_signature_user2)
 
-    conversation.add_participant_async(user1)
-    conversation.add_participant_async(user2)
+    conversation.add_participant_async(user1).get()
+    conversation.add_participant_async(user2).get()
     transcriber.join_conversation_async(conversation).get()
     transcriber.start_transcribing_async()