You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/communication-services/concepts/ai.md
+36-26Lines changed: 36 additions & 26 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,46 +6,56 @@ author: chpalm
6
6
manager: sundraman
7
7
services: azure-communication-services
8
8
ms.author: chpalm
9
-
ms.date: 07/10/2024
9
+
ms.date: 2/3/2025
10
10
ms.topic: conceptual
11
11
ms.service: azure-communication-services
12
12
---
13
13
14
14
# Artificial intelligence overview
15
15
16
-
AI technologies are useful for various communication experiences. This article summarizes the availability of AI and AI-adjacent features in Azure Communication Services. AI features can be split into three categories:
17
-
18
-
-**Accessors**: APIs that you can use to access Azure Communication Services data for the purposes of integrating your own separate transformations and bots.
19
-
-**Transformers**: APIs that provide a built-in transformation of communication data by using a machine learning or language model.
20
-
-**Bots**: APIs that implement bots that directly communicate with users. Typically, they blend structured programming with language models.
21
-
22
-
Typical communication scenarios integrate these capabilities:
16
+
AI technologies are useful for many communication experiences. AI can help humans communicate better and accomplish their mission more efficiently, for example, a banking employee may use an AI generated meeting summary to help them follow up. AI can reduce human workloads and enable more flexible customer engagement, such as operating a 24/7 phone bot that customers call to check their account balance.
23
17
18
+
More examples include:
19
+
- Operate a chat or voice bot that responds to human conversation.
24
20
- Transform audio speech content into text transcriptions.
25
21
- Transform a video feed to blur the user's background.
26
-
-Operate a chat or voice bot that responds to human conversation.
22
+
-Annotate and analyze conversations to identify trends and opportunities to improve service.
27
23
- Transform a corpus of text chat and meeting transcriptions into summaries. This experience might involve a generative AI interface in which a user asks, "Summarize all conversations between me and user Joe."
28
24
25
+
This article summarizes the availability of AI and AI-adjacent features in Azure Communication Services. There are two broad categories of AI functionality:
26
+
27
+
-**Integrated AI**: Azure Communication Services is directly integrated with Azure AI and Microsoft Copilot Studio. Generally these features require you to create and link both Azure Communication Services and Azure AI resources in the Azure portal. After this one-time linking, using these AI features is as straightforward as a single API call.
28
+
-**Accessors**: Azure Communication Services provides various APIs that give you raw and transformed access to your communication data making it easy for you to internal your own external services and AI systems.
29
+
29
30
## Messaging: SMS, chat, email, WhatsApp
30
31
31
-
Azure Communication Services capabilities for asynchronous messaging share common patterns for integrating AI listed here.
32
+
This section summarizes features for integrating AI into Azure Communication messaging.
| REST APIs and SDKs| ✅ ||| The messaging services center around REST APIs and server-oriented SDKs. You can use these SDKs to export content to an external datastore and attach a language model to summarize conversations. Or you can use the SDKs to integrate a bot that directly engages with human users. |
36
-
| WhatsApp message analysis || ✅ || The Azure Communication Services messaging APIs for WhatsApp provide a built-in integration with Azure OpenAI that analyzes and annotates messages. This integration can detect the user's language, recognize their intent, and extract key phrases. |
37
-
|[Azure Bot Service: Chat channel integration](../quickstarts/chat/quickstart-botframework-integration.md)||| ✅ | The Azure Communication Services chat system is directly integrated with Azure Bot Service. This integration simplifies creating chat bots that engage with human users.|
34
+
### Direct Integrations
35
+
36
+
-**[Advanced message analysis](../concepts/advanced-messaging/message-analysis/message-analysis.md)** The Azure Communication Services messaging APIs for WhatsApp provide a built-in integration with Azure OpenAI that analyzes and annotates messages. This integration can detect the user's language, recognize their intent, and extract key phrases.
37
+
-**[Azure Bot Service: Chat channel integration](../quickstarts/chat/quickstart-botframework-integration.md)** - The Azure Communication Services chat system is directly integrated with Azure Bot Service. This integration simplifies creating chat bots that engage with human users.
38
+
39
+
### Accessors
40
+
All Azure Communication Services messaging capabilities are accessible through REST APIs, server-oriented SDKs, and Event Grid notifications. You can use these SDKs to export content to an external datastore and attach a language model to summarize conversations. Or you can use the SDKs to integrate a bot that directly engages with human users. For example, this [GitHub sample](https://github.com/Azure-Samples/communication-services-javascript-quickstarts/tree/main/chat-nlp-analysis) shows how Azure Communication Services APIs for chat can be accessed through REST APIs and then analyzed by Azure OpenAI.
38
41
39
42
## Voice, video, and telephony
40
43
41
-
The patterns for integrating AI into the voice and video system are summarized here.
|[Call Automation REST APIs and SDKs](../concepts/call-automation/call-automation.md)| ✅ | ✅ || Call Automation APIs include both accessors and transformers, with REST APIs for playing audio files and recognizing a user’s response. The `recognize` APIs integrate Azure Bot Service to transform users' audio content into text for easier processing by your service. The most common scenario for these APIs is implementing voice bots, which is sometimes called interactive voice response (IVR). |
46
-
|[Microsoft Copilot Studio](/microsoft-copilot-studio/voice-overview)|| ✅ | ✅ | Copilot Studio is directly integrated with Azure Communication Services telephony. This integration is designed for voice bots and IVR. |
47
-
|[Copilot in the Azure portal](/azure/communication-services/concepts/voice-video-calling/call-diagnostics#copilot-in-azure-for-call-diagnostics)|| ✅ | ✅ | You can use Copilot in the Azure portal to ask questions about Azure Communication Services. Currently, Copilot answers questions by using information solely from Azure technical documentation. It's best used for asking questions about error codes and API behavior. |
48
-
|[Client raw audio and video](../concepts/voice-video-calling/media-access.md)| ✅ ||| The Calling client SDK provides APIs for accessing and modifying the raw audio and video feed. An example scenario is taking the video feed, detecting the human speaker and their background, and customizing that background. |
49
-
|[Client background effects](../quickstarts/voice-video-calling/get-started-video-effects.md?pivots=platform-web)|| ✅ || The Calling client SDKs provide APIs for blurring or replacing a user's background. |
50
-
|[Client captions](../concepts/voice-video-calling/closed-captions.md)|| ✅ || The Calling client SDK provides APIs for real-time closed captions. These APIs internally integrate Azure Cognitive Services to transform audio content from the call into text in real time. |
51
-
|[Client noise enhancement and effects](../tutorials/audio-quality-enhancements/add-noise-supression.md?pivots=platform-web)|| ✅ || The Calling client SDK integrates a [DeepVQE](https://arxiv.org/abs/2306.03177) machine learning model to improve audio quality through echo cancellation and background noise suppression. This transformation is toggled on and off by using the client SDK. |
44
+
This section summarizes features for integrating AI into Azure Communication voice and video calling.
45
+
46
+
### Direct Integrations
47
+
48
+
-**[Call Automation REST APIs and SDKs](../concepts/call-automation/call-automation.md)**- Azure Communication Services has simple APIs for [synthesizing](../concepts/call-automation/play-action.md) and [recognizing](../concepts/call-automation/recognize-action.md) speech. The most common scenario for these APIs is implementing voice bots, which is sometimes called interactive voice response (IVR).
49
+
-**[Microsoft Copilot Studio](/microsoft-copilot-studio/voice-overview)** - Copilot Studio is directly integrated with Azure Communication Services telephony. This integration is designed for voice bots and IVR.
50
+
-**[Client captions](../concepts/voice-video-calling/closed-captions.md)** The Calling client SDK provides APIs for real-time closed captions, optimized for accessibility.
51
+
-**[Copilot in the Azure portal](/azure/communication-services/concepts/voice-video-calling/call-diagnostics#copilot-in-azure-for-call-diagnostics)** - You can use Copilot in the Azure portal to ask questions about Azure Communication Services. Copilot uses Azure technical documentation to answer your questions and is best used for asking questions about error codes and API behavior.
52
+
-**[Client background effects](../quickstarts/voice-video-calling/get-started-video-effects.md?pivots=platform-web)** - The Calling client SDKs provide APIs for blurring or replacing a user's background.
53
+
-**[Client noise enhancement and effects](../tutorials/audio-quality-enhancements/add-noise-supression.md?pivots=platform-web)** - The Calling client SDK integrates a [DeepVQE](https://arxiv.org/abs/2306.03177) machine learning model to improve audio quality through echo cancellation and background noise suppression. This transformation is toggled on and off by using the client SDK.
54
+
55
+
### Accessors
56
+
Similar to Azure Communication Services messaging, there are REST APIs for many voice and video calling features. However the real-time nature of calling requires closed source SDKs and more complex APIs such as websocket streaming.
57
+
58
+
-**[Call Automation REST APIs and SDKs](../concepts/call-automation/call-automation.md)** - Services and AI applications use Call Automation REST APIs to answer, route, and manage all types of Azure voice and video calls.
59
+
-**[Service-to-service audio streaming](../concepts/call-automation/audio-streaming-concept.md)** - AI applications use Azure's service-to-service WebSockets API to stream audio data. This works in both directions, your AI can listen to a call, and speak.
60
+
-**[Service-to-service real-time transcription](../concepts/call-automation/real-time-transcription.md)** - AI applications use Azure's service-to-service WebSockets API to stream a real-time, Azure-generated transcription. Compared to audio or video content, transcript data is often easier for AI models to reason upon.
61
+
-**[Client raw audio and video](../concepts/voice-video-calling/media-access.md)** - The Calling client SDK provides APIs for accessing and modifying the raw audio and video feed. An example scenario is taking the video feed, using computer vision to distinguish the human speaker from their background, and customizing that background.
0 commit comments