Skip to content

Commit 879069c

Browse files
authored
Merge pull request #208973 from eric-urban/eur/jagoerge-callcenter-update-2
Eur/jagoerge callcenter update 2
2 parents 430bec7 + fe61683 commit 879069c

File tree

11 files changed

+172
-185
lines changed

11 files changed

+172
-185
lines changed

articles/cognitive-services/.openpublishing.redirection.cognitive-services.json

Lines changed: 6 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4242,7 +4242,7 @@
42424242
},
42434243
{
42444244
"source_path_from_root": "/articles/cognitive-services/Speech-Service/index-conversation-scenarios.yml",
4245-
"redirect_url": "/azure/cognitive-services/Speech-Service/call-center-transcription.md",
4245+
"redirect_url": "/azure/cognitive-services/Speech-Service/call-center-overview",
42464246
"redirect_document_id": false
42474247
},
42484248
{
@@ -4955,6 +4955,11 @@
49554955
"redirect_url": "/azure/cognitive-services/speech-service/overview",
49564956
"redirect_document_id": false
49574957
},
4958+
{
4959+
"source_path_from_root": "/articles/cognitive-services/Speech-Service/call-center-transcription.md ",
4960+
"redirect_url": "/azure/cognitive-services/speech-service/call-center-overview",
4961+
"redirect_document_id": false
4962+
},
49584963
{
49594964
"source_path_from_root": "/articles/cognitive-services/text-analytics/concepts/data-limits.md",
49604965
"redirect_url": "/azure/cognitive-services/language-service/overview",
Lines changed: 73 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,73 @@
1+
---
2+
title: Azure Cognitive Services for Call Center Overview
3+
titleSuffix: Azure Cognitive Services
4+
description: Azure Cognitive Services for Language and Speech can help you realize partial or full automation of telephony-based customer interactions, and provide accessibility across multiple channels.
5+
services: cognitive-services
6+
author: goergenj
7+
ms.author: jagoerge
8+
ms.service: cognitive-services
9+
ms.subservice: speech-service
10+
ms.topic: conceptual
11+
ms.date: 08/10/2022
12+
---
13+
14+
# Call Center Overview
15+
16+
Azure Cognitive Services for Language and Speech can help you realize partial or full automation of telephony-based customer interactions, and provide accessibility across multiple channels. With the Language and Speech services, you can further analyze call center transcriptions, extract and redact conversation personally identifiable information (PII), summarize the transcription, and detect the sentiment.
17+
18+
Some example scenarios for the implementation of Azure Cognitive Services in call and contact centers are:
19+
- Virtual agents: Conversational AI-based telephony-integrated voicebots and voice-enabled chatbots
20+
- Agent-assist: Real-time transcription and analysis of a call to improve the customer experience by providing insights and suggest actions to agents
21+
- Post-call analytics: Post-call analysis to create insights into customer conversations to improve understanding and support continuous improvement of call handling, optimization of quality assurance and compliance control as well as other insight driven optimizations.
22+
23+
> [!TIP]
24+
> To deploy a call center transcription solution to Azure with a no-code approach, try the [Ingestion Client](/azure/cognitive-services/speech-service/ingestion-client).
25+
26+
## Cognitive Services features for call centers
27+
28+
A holistic call center implementation typically incorporates technologies from the Language and Speech services.
29+
30+
Audio data typically used in call centers generated through landlines, mobile phones, and radios is often narrowband, in the range of 8 KHz, which can create challenges when you're converting speech to text. The Speech service recognition models are trained to ensure that you can get high-quality transcriptions, however you choose to capture the audio.
31+
32+
Once you've transcribed your audio with the Speech service, you can use the Language service to perform analytics on your call center data such as: sentiment analysis, summarizing the reason for customer calls, how they were resolved, extracting and redacting conversation PII, and more.
33+
34+
### Speech service
35+
36+
The Speech service offers the following features that can be used for call center use cases:
37+
38+
- [Real-time speech-to-text](/azure/cognitive-services/speech-service/how-to-recognize-speech): Recognize and transcribe audio in real-time from multiple inputs. For example, with virtual agents or agent-assist, you can continuously recognize audio input and control how to process results based on multiple events.
39+
- [Batch speech-to-text](/azure/cognitive-services/speech-service/batch-transcription): Transcribe large amounts of audio files asynchronously including speaker diarization and is typically used in post-call analytics scenarios. Diarization is the process of recognizing and separating speakers in mono channel audio data.
40+
- [Text-to-speech](/azure/cognitive-services/speech-service/text-to-speech): Text-to-speech enables your applications, tools, or devices to convert text into humanlike synthesized speech.
41+
- [Speaker identification](/azure/cognitive-services/speech-service/speaker-recognition-overview): Helps you determine an unknown speaker’s identity within a group of enrolled speakers and is typically used for call center customer verification scenarios or fraud detection.
42+
- [Language Identification](/azure/cognitive-services/speech-service/language-identification): Identify languages spoken in audio and can be used in real-time and post-call analysis for insights or to control the environment (such as output language of a virtual agent).
43+
44+
The Speech service works well with prebuilt models. However, you might want to further customize and tune the experience for your product or environment. Typical examples for Speech customization include:
45+
46+
| Speech customization | Description |
47+
| -------------- | ----------- |
48+
| [Custom Speech](/azure/cognitive-services/speech-service/custom-speech-overview) | A speech-to-text feature used evaluate and improve the speech recognition accuracy of use-case specific entities (such as alpha-numeric customer, case, and contract IDs, license plates, and names). You can also train a custom model with your own product names and industry terminology. |
49+
| [Custom Neural Voice](/azure/cognitive-services/speech-service/custom-neural-voice) | A text-to-speech feature that lets you create a one-of-a-kind, customized, synthetic voice for your applications. |
50+
51+
### Language service
52+
53+
The Language service offers the following features that can be used for call center use cases:
54+
55+
- [Personally Identifiable Information (PII) extraction and redaction](/azure/cognitive-services/language-service/personally-identifiable-information/how-to-call-for-conversations): Identify, categorize, and redact sensitive information in conversation transcription.
56+
- [Conversation summarization](/azure/cognitive-services/language-service/summarization/overview?tabs=conversation-summarization): Summarize in abstract text what each conversation participant said about the issues and resolutions. For example, a call center can group product issues that have a high volume.
57+
- [Sentiment analysis and opinion mining](/azure/cognitive-services/language-service/sentiment-opinion-mining/overview): Analyze transcriptions and associate positive, neutral, or negative sentiment at the utterance and conversation-level.
58+
59+
While the Language service works well with prebuilt models, you might want to further customize and tune models to extract more information from your data. Typical examples for Language customization include:
60+
61+
| Language customization | Description |
62+
| -------------- | ----------- |
63+
| [Custom NER (named entity recognition)](/azure/cognitive-services/language-service/custom-named-entity-recognition/overview) | Improve the detection and extraction of entities in transcriptions. |
64+
| [Custom text classification](/azure/cognitive-services/language-service/custom-text-classification/overview) | Classify and label transcribed utterances with either single or multiple classifications. |
65+
66+
You can find an overview of all Language service features and customization options [here](/azure/cognitive-services/language-service/overview#available-features).
67+
68+
## Next steps
69+
70+
* [Try out the Language Studio](https://language.cognitive.azure.com)
71+
* [Explore the Language service features](/azure/cognitive-services/language-service/overview#available-features)
72+
* [Try out the Speech Studio](https://speech.microsoft.com)
73+
* [Explore the Speech service features](/azure/cognitive-services/speech-service/overview)
Lines changed: 36 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,36 @@
1+
---
2+
title: Call Center Telephony Integration - Speech service
3+
titleSuffix: Azure Cognitive Services
4+
description: A common scenario for speech-to-text is transcribing large volumes of telephony data that come from various systems, such as interactive voice response (IVR) in real-time. This requires an integration with the Telephony System used.
5+
services: cognitive-services
6+
author: goergenj
7+
ms.author: jagoerge
8+
ms.service: cognitive-services
9+
ms.subservice: speech-service
10+
ms.topic: conceptual
11+
ms.date: 08/10/2022
12+
ms.custom: template-concept
13+
---
14+
15+
# Telephony Integration
16+
17+
To support real-time scenarios, like Virtual Agent and Agent Assist in Call Centers, an integration with the Call Centers telephony system is required.
18+
19+
Typically, the integration with Microsoft Speech Services is handled by a telephony client connected to the customers SIP/RTP processor, for example, to a Session Border Controller (SBC).
20+
21+
Usually the telephony client handles the incoming audio stream from the SIP/RTP processor, the conversion to PCM and connects the streams using continuous recognition. It also triages the processing of the results, for example, analysis of speech transcripts for Agent Assist or connect with a dialog processing engine (for example, Azure Botframework or Power Virtual Agent) for Virtual Agent.
22+
23+
For easier integration the Speech Service also supports “ALAW in WAV container” and “MULAW in WAV container” for audio streaming.
24+
25+
To build this integration we recommend using the [Speech SDK](/azure/cognitive-services/speech-service/speech-sdk).
26+
27+
28+
> [!TIP]
29+
> For guidance on reducing Text to Speech latency check out the **[How to lower speech synthesis latency](/azure/cognitive-services/speech-service/how-to-lower-speech-synthesis-latency?pivots=programming-language-csharp)** guide.
30+
>
31+
> In addition, consider implementing a Text to Speech cache to store all synthesized audio and playback from the cache in case a string has previously been synthesized.
32+
33+
## Next steps
34+
35+
* [Learn about Speech SDK](/azure/cognitive-services/speech-service/speech-sdk)
36+
* [How to lower speech synthesis latency](/azure/cognitive-services/speech-service/how-to-lower-speech-synthesis-latency)

0 commit comments

Comments
 (0)