You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/communication-services/concepts/call-automation/audio-streaming-concept.md
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -34,7 +34,7 @@ This public preview supports the ability for developers to access real-time audi
34
34
35
35
-**Biometric authentication** – Use the audio streams to carry out voice authentication, by running the audio from the call through your voice recognition/matching engine/tool.
36
36
37
-
## Sample architecture for subscribing to audio streams from an ongoing call - live agent scenario
37
+
## Sample architecture showing how bidirectional audio streaming can be used for conversational AI agents
38
38
39
39
[](./media/bidirectional-streaming.png#lightbox)
40
40
@@ -47,10 +47,10 @@ Contains mixed audio of all participants on the call. All audio is flattened int
47
47
Contains audio per participant per channel, with support for up to four channels for the four most dominant speakers at any point in a call. You also get a participantRawID that you can use to determine the speaker.
48
48
49
49
## Additional information
50
-
The following describes information that helps developers convert the audio packets into audible content that can be used by their applications.
50
+
The following describes information about audio that is sent from Azure Communication Services, that developers can use to convert the audio packets into audible content that can be used by their applications.
51
51
- Framerate: 50 frames per second
52
52
- Packet stream rate: 20-ms rate
53
-
- Data packet: 64 Kbytes
53
+
- Data packet size: 640 bytes for 16000 hz and 960 bytes for 24000 hz
54
54
- Audio metric: 16-bit PCM mono at 16000 hz and 24000 hz
55
55
- Public string data is a base64 string that should be converted into a byte array to create raw PCM file.
0 commit comments