Skip to content

Commit 51efd99

Browse files
authored
Update audio-streaming-concept.md
add content for bidirectional streaming
1 parent 40bb6c8 commit 51efd99

File tree

1 file changed

+14
-9
lines changed

1 file changed

+14
-9
lines changed

articles/communication-services/concepts/call-automation/audio-streaming-concept.md

Lines changed: 14 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ description: Conceptual information about using Audio Streaming APIs with Call A
55
author: Alvin
66
ms.service: azure-communication-services
77
ms.topic: overview
8-
ms.date: 07/17/2024
8+
ms.date: 11/24/2024
99
ms.author: alvinhan
1010
ms.custom: public_prview
1111
---
@@ -14,20 +14,25 @@ ms.custom: public_prview
1414

1515
[!INCLUDE [Public Preview Disclaimer](../../includes/public-preview-include-document.md)]
1616

17-
Azure Communication Services provides developers with Audio Streaming capabilities to get real-time access to audio streams to capture, analyze, and process audio content during active calls. In today's world consumption of live audio and video is prevalent, this content could be in the forms of online meetings, online conferences, customer support, etc. With audio streaming access, developers can now build server applications to capture and analyze audio streams for each of the participants on the call in real-time. Developers can also combine audio streaming with other call automation actions or use their own AI models to analyze audio streams. Use cases include NLP for conversation analysis or providing real-time insights and suggestions to agents while they are in an active interaction with end users.
17+
Azure Communication Services provides bidirectional audio streaming capabilities, offering developers powerful tools to capture, analyze, and process audio content during active calls. This paves the way for new possibilities in real-time communication for developers and businesses alike.
1818

19-
This public preview supports the ability for developers to get access to real-time audio streams over a WebSocket to analyze the call's audio in mixed and unmixed formats.
19+
By integrating bidirectional audio streaming with services like Azure Open AI and other real-time voice APIs, businesses can achieve seamless, low-latency communication. This significantly enhances the development and deployment of conversational AI solutions, allowing for more engaging and efficient interactions.
2020

21-
## Common use cases
22-
Audio streams can be used in many ways. Some examples of how developers may wish to use the audio streams in their applications include:
21+
With bidirectional streaming, businesses can now elevate their voice solutions to low-latency, human-like, interactive conversational AI agents. Our bidirectional streaming APIs enable developers to stream audio from an ongoing call on Azure Communication Services to their web servers in real-time and stream audio back into the call. While the initial focus of these features is to help businesses create conversational AI agents, other use cases include Natural Language Processing for conversation analysis or providing real-time insights and suggestions to agents while they are in active interaction with end users.
22+
23+
This public preview supports the ability for developers to access real-time audio streams over a WebSocket from Azure Communication Services and stream audio back into the call.
2324

2425
### Real-time call assistance
2526

26-
**Improved AI powered suggestions** - Use real-time audio streams of active interactions between agents and customers to gauge the intent of the call and how your agents can provide a better experience to their customer through active suggestions using your own AI model to analyze the call.
27+
- **Leverage conversational AI Solutions:** Develop sophisticated customer support virtual agents that can interact with customers in real-time, providing immediate responses and solutions.
28+
29+
- **Personalized customer experiences:** By harnessing real-time data, businesses can offer more personalized and dynamic customer interactions in real-time, leading to increased satisfaction and loyalty.
30+
31+
- **Reduce wait times for customers:** By using bidirectional audio streams in combination with Large Language Models (LLMs) you can build virtual agents that can be the first point of contact for customers reducing the need for customers waiting for a human agent being available.
2732

2833
### Authentication
2934

30-
**Biometric authentication** – Use the audio streams to carry out voice authentication, by running the audio from the call through your voice recognition/matching engine/tool.
35+
- **Biometric authentication** – Use the audio streams to carry out voice authentication, by running the audio from the call through your voice recognition/matching engine/tool.
3136

3237
## Sample architecture for subscribing to audio streams from an ongoing call - live agent scenario
3338

@@ -46,8 +51,8 @@ The table below describes information that will help developers convert the audi
4651
- Framerate: 50 frames per second
4752
- Packet stream rate: 20 ms rate
4853
- Data packet: 64 Kbytes
49-
- Audio metric: 16-bit PCM mono at 16000 hz
50-
- Public string data is a base64 string that should be converted into a byte array to create raw PCM file.\
54+
- Audio metric: 16-bit PCM mono at 16000 hz and 24000 hz
55+
- Public string data is a base64 string that should be converted into a byte array to create raw PCM file.
5156

5257
## Billing
5358
See the [Azure Communication Services pricing page](https://azure.microsoft.com/pricing/details/communication-services/?msockid=3b3359f3828f6cfe30994a9483c76d50) for information on how audio streaming is billed. Prices can be found in the calling category under audio streaming.

0 commit comments

Comments
 (0)