Merge pull request #213036 from eric-urban/eur/call-center-qs

v-dirichards · web-flow · commit 1d7350f353f4 · 2022-09-30T10:27:25.000-05:00
Clarify language flexibility and output details
diff --git a/articles/cognitive-services/Speech-Service/includes/quickstarts/call-center/azure-prerequisites.md b/articles/cognitive-services/Speech-Service/includes/quickstarts/call-center/azure-prerequisites.md
@@ -2,7 +2,7 @@
 author: eric-urban
 ms.service: cognitive-services
 ms.subservice: speech-service
-ms.date: 06/30/2022
+ms.date: 09/29/2022
 ms.topic: include
 ms.author: eur
 ---
@@ -13,4 +13,6 @@ ms.author: eur
 > * Get the resource key and region. After your Cognitive Services resource is deployed, select **Go to resource** to view and manage keys. For more information about Cognitive Services resources, see [Get the keys for your resource](~/articles/cognitive-services/cognitive-services-apis-create-account.md#get-the-keys-for-your-resource). 
 
 > [!IMPORTANT]
-> This quickstart requires access to [conversation summarization](/azure/cognitive-services/language-service/summarization/how-to/conversation-summarization). To get access, you must submit an [online request](https://aka.ms/applyforconversationsummarization/) and have it approved.
+> This quickstart requires access to [conversation summarization](/azure/cognitive-services/language-service/summarization/how-to/conversation-summarization). To get access, you must submit an [online request](https://aka.ms/applyforconversationsummarization/) and have it approved. 
+> 
+> The `--languageKey` and `--languageEndpoint` values in this quickstart must correspond to a resource that's in one of the regions supported by the [conversation summarization API](https://aka.ms/convsumregions). 
diff --git a/articles/cognitive-services/Speech-Service/includes/quickstarts/call-center/csharp.md b/articles/cognitive-services/Speech-Service/includes/quickstarts/call-center/csharp.md
@@ -31,58 +31,26 @@ Follow these steps to run post-call transcription analysis from an audio file.
     ```dotnetcli
     dotnet build
     ```
-1. Run the application with your preferred command line arguments. See [usage and arguments](#usage-and-arguments) for the available options. Here is an example:
+1. Run the application with your preferred command line arguments. See [usage and arguments](#usage-and-arguments) for the available options. 
+    
+    Here's an example that transcribes from an example audio file at [GitHub](https://github.com/Azure-Samples/cognitive-services-speech-sdk/raw/master/scenarios/call-center/sampledata/Call1_separated_16k_health_insurance.wav):
     ```dotnetcli
-    dotnet run --languageKey YourResourceKey --languageEndpoint YourResourceEndpoint --speechKey YourResourceKey --speechRegion YourResourceRegion --input "https://github.com/Azure-Samples/cognitive-services-speech-sdk/raw/master/scenarios/call-center/sampledata/Call1_separated_16k_health_insurance.wav" --stereo  --output summary.txt
+    dotnet run --languageKey YourResourceKey --languageEndpoint YourResourceEndpoint --speechKey YourResourceKey --speechRegion YourResourceRegion --input "https://github.com/Azure-Samples/cognitive-services-speech-sdk/raw/master/scenarios/call-center/sampledata/Call1_separated_16k_health_insurance.wav" --stereo  --output summary.json
     ```
+    
+    If you already have a transcription for input, here's an example that only requires a Language resource:
+    ```dotnetcli
+    dotnet run --languageKey YourResourceKey --languageEndpoint YourResourceEndpoint --jsonInput "YourTranscriptionFile.json" --stereo  --output summary.json
+    ```
+    
     Replace `YourResourceKey` with your Cognitive Services resource key, replace `YourResourceRegion` with your Cognitive Services resource [region](~/articles/cognitive-services/speech-service/regions.md) (such as `eastus`), and replace `YourResourceEndpoint` with your Cognitive Services endpoint. Make sure that the paths specified by `--input` and `--output` are valid. Otherwise you must change the paths.
-
     > [!IMPORTANT]
     > Remember to remove the key from your code when you're done, and never post it publicly. For production, use a secure way of storing and accessing your credentials like [Azure Key Vault](../../../../../key-vault/general/overview.md). See the Cognitive Services [security](../../../../cognitive-services-security.md) article for more information.
 
+
 ## Check results
 
-The console output shows the full conversation and summary. Here's an example of the overall summary:
-
-```output
-Conversation summary:
-    Issue: Customer wants to sign up for insurance.
-    Resolution: Helped customer to sign up for insurance.
-```
-
-If you specify `--output FILE`, a JSON version of the results are written to the file. The file output is a combination of the JSON responses from the [batch transcription](/azure/cognitive-services/speech-service/batch-transcription) (Speech), [sentiment](/azure/cognitive-services/language-service/sentiment-opinion-mining/overview) (Language), and [conversation summarization](/azure/cognitive-services/language-service/summarization/overview?tabs=conversation-summarization) (Language) APIs. 
-
-The `transcription` property contains a JSON object with the results of sentiment analysis merged with batch transcription. Here's an example, with redactions for brevity:
-```json
-{
-    "source": "https://github.com/Azure-Samples/cognitive-services-speech-sdk/raw/master/scenarios/call-center/sampledata/Call1_separated_16k_health_insurance.wav",
-// Example results redacted for brevity
-        "nBest": [
-          {
-            "confidence": 0.77464247,
-            "lexical": "hello thank you for calling contoso who am i speaking with today",
-            "itn": "hello thank you for calling contoso who am i speaking with today",
-            "maskedITN": "hello thank you for calling contoso who am i speaking with today",
-            "display": "Hello, thank you for calling Contoso. Who am I speaking with today?",
-            "sentiment": {
-              "positive": 0.78,
-              "neutral": 0.21,
-              "negative": 0.01
-            }
-          },
-        ]
-// Example results redacted for brevity
-}   
-```
-
-The `conversationAnalyticsResults` property contains a JSON object with the results of the conversation summarization analysis. Here's an example, with redactions for brevity:
-```json
-{
-    "conversationSummaryResults": {
-    }
-// Example results redacted for brevity
-}
-```
+[!INCLUDE [Example output](example-output.md)]
 
 ## Usage and arguments
 
diff --git a/articles/cognitive-services/Speech-Service/includes/quickstarts/call-center/example-output.md b/articles/cognitive-services/Speech-Service/includes/quickstarts/call-center/example-output.md
@@ -0,0 +1,154 @@
+---
+author: eric-urban
+ms.service: cognitive-services
+ms.subservice: speech-service
+ms.date: 09/29/2022
+ms.topic: include
+ms.author: eur
+---
+
+The console output shows the full conversation and summary. Here's an example of the overall summary, with redactions for brevity:
+
+```output
+Conversation summary:
+    issue: Customer wants to sign up for insurance.
+    resolution: Customer was advised that customer would be contacted by the insurance company.
+```
+
+If you specify the `--output FILE` optional [argument](/azure/cognitive-services/speech-service/call-center-quickstart#usage-and-arguments), a JSON version of the results are written to the file. The file output is a combination of the JSON responses from the [batch transcription](/azure/cognitive-services/speech-service/batch-transcription) (Speech), [sentiment](/azure/cognitive-services/language-service/sentiment-opinion-mining/overview) (Language), and [conversation summarization](/azure/cognitive-services/language-service/summarization/overview?tabs=conversation-summarization) (Language) APIs. 
+
+The `transcription` property contains a JSON object with the results of sentiment analysis merged with batch transcription. Here's an example, with redactions for brevity:
+```json
+{
+    "source": "https://github.com/Azure-Samples/cognitive-services-speech-sdk/raw/master/scenarios/call-center/sampledata/Call1_separated_16k_health_insurance.wav",
+// Example results redacted for brevity
+        "nBest": [
+          {
+            "confidence": 0.77464247,
+            "lexical": "hello thank you for calling contoso who am i speaking with today",
+            "itn": "hello thank you for calling contoso who am i speaking with today",
+            "maskedITN": "hello thank you for calling contoso who am i speaking with today",
+            "display": "Hello, thank you for calling Contoso. Who am I speaking with today?",
+            "sentiment": {
+              "positive": 0.78,
+              "neutral": 0.21,
+              "negative": 0.01
+            }
+          },
+        ]
+// Example results redacted for brevity
+}   
+```
+
+The `conversationAnalyticsResults` property contains a JSON object with the results of the conversation summarization analysis. Here's an example, with redactions for brevity:
+```json
+{
+  "conversationAnalyticsResults": {
+    "conversationSummaryResults": {
+      "conversations": [
+        {
+          "id": "conversation1",
+          "summaries": [
+            {
+              "aspect": "issue",
+              "text": "Customer wants to sign up for insurance"
+            },
+            {
+              "aspect": "resolution",
+              "text": "Customer was advised that customer would be contacted by the insurance company"
+            }
+          ],
+          "warnings": []
+        }
+      ],
+      "errors": [],
+      "modelVersion": "2022-05-15-preview"
+    },
+    "conversationPiiResults": {
+      "combinedRedactedContent": [
+        {
+          "channel": "0",
+          "display": "Hello, thank you for calling Contoso. Who am I speaking with today? Hi, ****. Uh, are you calling because you need health insurance?", // Example results redacted for brevity
+          "itn": "hello thank you for calling contoso who am i speaking with today hi **** uh are you calling because you need health insurance", // Example results redacted for brevity
+          "lexical": "hello thank you for calling contoso who am i speaking with today hi **** uh are you calling because you need health insurance" // Example results redacted for brevity
+        },
+        {
+          "channel": "1",
+          "display": "Hi, my name is **********. I'm trying to enroll myself with Contoso. Yes. Yeah, I'm calling to sign up for insurance.", // Example results redacted for brevity
+          "itn": "hi my name is ********** i'm trying to enroll myself with contoso yes yeah i'm calling to sign up for insurance", // Example results redacted for brevity
+          "lexical": "hi my name is ********** i'm trying to enroll myself with contoso yes yeah i'm calling to sign up for insurance" // Example results redacted for brevity
+        }
+      ],
+      "conversations": [
+        {
+          "id": "conversation1",
+          "conversationItems": [
+            {
+              "id": "0",
+              "redactedContent": {
+                "itn": "hello thank you for calling contoso who am i speaking with today",
+                "lexical": "hello thank you for calling contoso who am i speaking with today",
+                "text": "Hello, thank you for calling Contoso. Who am I speaking with today?"
+              },
+              "entities": [],
+              "channel": "0",
+              "offset": "PT0.77S"
+            },
+            {
+              "id": "1",
+              "redactedContent": {
+                "itn": "hi my name is ********** i'm trying to enroll myself with contoso",
+                "lexical": "hi my name is ********** i'm trying to enroll myself with contoso",
+                "text": "Hi, my name is **********. I'm trying to enroll myself with Contoso."
+              },
+              "entities": [
+                {
+                  "text": "Mary Rondo",
+                  "category": "Name",
+                  "offset": 15,
+                  "length": 10,
+                  "confidenceScore": 0.97
+                }
+              ],
+              "channel": "1",
+              "offset": "PT4.55S"
+            },
+            {
+              "id": "2",
+              "redactedContent": {
+                "itn": "hi **** uh are you calling because you need health insurance",
+                "lexical": "hi **** uh are you calling because you need health insurance",
+                "text": "Hi, ****. Uh, are you calling because you need health insurance?"
+              },
+              "entities": [
+                {
+                  "text": "Mary",
+                  "category": "Name",
+                  "offset": 4,
+                  "length": 4,
+                  "confidenceScore": 0.93
+                }
+              ],
+              "channel": "0",
+              "offset": "PT9.55S"
+            },
+            {
+              "id": "3",
+              "redactedContent": {
+                "itn": "yes yeah i'm calling to sign up for insurance",
+                "lexical": "yes yeah i'm calling to sign up for insurance",
+                "text": "Yes. Yeah, I'm calling to sign up for insurance."
+              },
+              "entities": [],
+              "channel": "1",
+              "offset": "PT13.09S"
+            },
+// Example results redacted for brevity
+          ],
+          "warnings": []
+        }
+      ]
+    }
+  }
+}
+```
diff --git a/articles/cognitive-services/Speech-Service/includes/quickstarts/call-center/intro.md b/articles/cognitive-services/Speech-Service/includes/quickstarts/call-center/intro.md
@@ -2,7 +2,7 @@
 author: eric-urban
 ms.service: cognitive-services
 ms.topic: include
-ms.date: 09/18/2022
+ms.date: 09/29/2022
 ms.author: eur
 ---
 
diff --git a/articles/cognitive-services/Speech-Service/includes/quickstarts/call-center/usage-arguments.md b/articles/cognitive-services/Speech-Service/includes/quickstarts/call-center/usage-arguments.md
@@ -2,7 +2,7 @@
 author: eric-urban
 ms.service: cognitive-services
 ms.topic: include
-ms.date: 09/18/2022
+ms.date: 09/29/2022
 ms.author: eur
 ---
 
@@ -20,7 +20,7 @@ Connection options include:
 Input options include:
 
 - `--input URL`: Input audio from URL. You must set either the `--input` or `--jsonInput` option. 
-- `--jsonInput FILE`: Input an existing batch transcription JSON result from FILE. Use this option to process a transcription result that was previously generated by the Speech service. With this option, you don't need an audio file. Overrides `--input`, `--speechKey`, and `--speechRegion`. You must set either the `--input` or `--jsonInput` option.
+- `--jsonInput FILE`: Input an existing batch transcription JSON result from FILE. With this option, you only need a Language resource to process a transcription that you already have. With this option, you don't need an audio file or a Speech resource. Overrides `--input`. You must set either the `--input` or `--jsonInput` option.
 - `--stereo`: Use stereo audio format. If stereo isn't specified, then mono 16khz 16 bit PCM wav files are assumed. Diarization of mono files is used to separate multiple speakers. Diarization of stereo files isn't supported, since 2-channel stereo files should already have one speaker per channel.
 - `--certificate`: The PEM certificate file. Required for C++. 
 
@@ -32,4 +32,4 @@ Language options include:
 Output options include:
 
 - `--help`: Show the usage help and stop
-- `--output FILE`: Output the transcription, sentiment, and conversation summaries in JSON format to a text file. 
+- `--output FILE`: Output the transcription, sentiment, and conversation summaries in JSON format to a text file. For more information, see [output examples](/azure/cognitive-services/speech-service/call-center-quickstart#check-results).