MicrosoftDocs
diff --git a/‎articles/communication-services/.openpublishing.redirection.communication-services.json
Lines changed: 10 additions & 0 deletions b/‎articles/communication-services/.openpublishing.redirection.communication-services.json
Lines changed: 10 additions & 0 deletions
diff --git a/‎articles/communication-services/concepts/call-automation/audio-streaming-concept.md
Lines changed: 53 additions & 0 deletions b/‎articles/communication-services/concepts/call-automation/audio-streaming-concept.md
Lines changed: 53 additions & 0 deletions
diff --git a/‎articles/communication-services/concepts/call-automation/media/audio-streaming-diagram.png
98.9 KB b/‎articles/communication-services/concepts/call-automation/media/audio-streaming-diagram.png
98.9 KB
diff --git a/‎articles/communication-services/concepts/voice-video-calling/media-streaming.md
Lines changed: 0 additions & 57 deletions b/‎articles/communication-services/concepts/voice-video-calling/media-streaming.md
Lines changed: 0 additions & 57 deletions
diff --git a/‎articles/communication-services/how-tos/call-automation/audio-streaming-quickstart.md
Lines changed: 64 additions & 0 deletions b/‎articles/communication-services/how-tos/call-automation/audio-streaming-quickstart.md
Lines changed: 64 additions & 0 deletions
diff --git a/‎articles/communication-services/how-tos/call-automation/includes/audio-streaming-quickstart-csharp.md
Lines changed: 139 additions & 0 deletions b/‎articles/communication-services/how-tos/call-automation/includes/audio-streaming-quickstart-csharp.md
Lines changed: 139 additions & 0 deletions
@@ -440,6 +440,16 @@
             "source_path_from_root": "/articles/communication-services/quickstarts/media-composition/get-started-media-composition.md",
             "redirect_url": "/azure/communication-services/quickstarts/voice-video-calling/getting-started-with-calling",
             "redirect_document_id": false
+        },
+        {
+            "source_path_from_root": "/articles/communication-services/quickstarts/voice-video-calling/media-streaming.md",
+            "redirect_url": "/azure/communication-services/how-tos/call-automation/audio-streaming-quickstart",
+            "redirect_document_id": false
+        },
+        {
+            "source_path_from_root": "/articles/communication-services/concepts/voice-video-calling/media-streaming.md",
+            "redirect_url": "/azure/communication-services/concepts/call-automation/audio-streaming-concept",
+            "redirect_document_id": false
         }
     ]
 }
@@ -0,0 +1,53 @@
+---
+title: Audio streaming overview
+titleSuffix: An Azure Communication Services concept document
+description: Conceptual information about using Audio Streaming APIs with Call Automation.
+author: Alvin
+ms.service: azure-communication-services
+ms.topic: overview
+ms.date: 07/17/2024
+ms.author: alvinhan
+ms.custom: public_prview
+---
+
+# Audio streaming overview - audio subscription
+
+[!INCLUDE [Public Preview Disclaimer](../../includes/public-preview-include-document.md)]
+
+Azure Communication Services provides developers with Audio Streaming capabilities to get real-time access to audio streams to capture, analyze, and process audio content during active calls. In today's world consumption of live audio and video is prevalent, this content could be in the forms of online meetings, online conferences, customer support, etc.  With audio streaming access, developers can now build server applications to capture and analyze audio streams for each of the participants on the call in real-time. Developers can also combine audio streaming with other call automation actions or use their own AI models to analyze audio streams. Use cases include NLP for conversation analysis or providing real-time insights and suggestions to agents while they are in an active interaction with end users. 
+
+This public preview supports the ability for developers to get access to real-time audio streams over a WebSocket to analyze the call's audio in mixed and unmixed formats.
+
+## Common use cases
+Audio streams can be used in many ways. Some examples of how developers may wish to use the audio streams in their applications include:
+
+### Real-time call assistance
+
+**Improved AI powered suggestions** - Use real-time audio streams of active interactions between agents and customers to gauge the intent of the call and how your agents can provide a better experience to their customer through active suggestions using your own AI model to analyze the call.
+
+### Authentication
+
+**Biometric authentication** – Use the audio streams to carry out voice authentication, by running the audio from the call through your voice recognition/matching engine/tool.
+
+## Sample architecture for subscribing to audio streams from an ongoing call - live agent scenario
+
+[![Screenshot of architecture diagram for audio streaming.](./media/audio-streaming-diagram.png)](./media/audio-streaming-diagram.png#lightbox)
+
+## Supported formats
+
+### Mixed format
+Contains mixed audio of all participants on the call.  All audio is flattened into one stream.
+	
+### Unmixed
+Contains audio per participant per channel, with support for up to four channels for the four most dominant speakers at any point in a call. You'll also get a participantRawID that you can use to determine the speaker. 
+
+## Additional information
+The table below describes information that will help developers convert the audio packets into audible content that can be used by their applications.
+- Framerate: 50 frames per second
+- Packet stream rate: 20 ms rate
+- Data packet: 64 Kbytes
+- Audio metric: 16-bit PCM mono at 16000 hz
+- Public string data is a base64 string that should be converted into a byte array to create raw PCM file.
+
+## Next Steps
+Check out the [audio streaming quickstart](../../how-tos/call-automation/audio-streaming-quickstart.md) to learn more.
@@ -0,0 +1,64 @@
+---
+title: Audio streaming quickstart
+titleSuffix: An Azure Communication Services quickstart document
+description: Provides a quick start for developers to get audio streams through audio streaming APIs from Azure Communication Services calls.
+author: alvin
+ms.service: azure-communication-services
+ms.topic: include
+ms.date: 7/15/2024
+ms.author: alvinhan
+ms.custom: 
+services: azure-communication-services
+zone_pivot_groups: acs-js-csharp-java-python
+---
+
+# Quickstart: Server-side Audio Streaming
+
+[!INCLUDE [Public Preview Disclaimer](../../includes/public-preview-include-document.md)]
+
+Get started with using audio streams through Azure Communication Services Audio Streaming API. This quickstart assumes you're already familiar with Call Automation APIs to build an automated call routing solution. 
+
+Functionality described in this quickstart is currently in public preview.
+
+::: zone pivot="programming-language-csharp"
+[!INCLUDE [Audio Streaming with .NET](./includes//audio-streaming-quickstart-csharp.md)]
+::: zone-end
+
+::: zone pivot="programming-language-java"
+[!INCLUDE [Audio Streaming with Java](./includes/audio-streaming-quickstart-java.md)]
+::: zone-end
+
+::: zone pivot="programming-language-javascript"
+[!INCLUDE [Audio Streaming with JavaScript](./includes/audio-streaming-quickstart-js.md)]
+::: zone-end
+
+::: zone pivot="programming-language-python"
+[!INCLUDE [Audio Streaming with Python](./includes/audio-streaming-quickstart-python.md)]
+::: zone-end
+
+
+## Audio streaming schema
+After sending through the metadata packet, Azure Communication Services will start streaming audio media to your WebSocket server. Below is an example of what the media object your server will receive looks like. 
+
+``` code
+{
+    "kind": <string>, // What kind of data this is, e.g. AudioMetadata, AudioData.
+    "audioData":{
+        "data": <string>, // Base64 Encoded audio buffer data
+        "timestamp": <string>, // In ISO 8601 format (yyyy-mm-ddThh:mm:ssZ) 
+        "participantRawID": <string>, 
+        "silent": <boolean> // Indicates if the received audio buffer contains only silence.
+    }
+}
+```
+
+
+## Clean up resources
+
+If you want to clean up and remove a Communication Services subscription, you can delete the resource or resource group. Deleting the resource group also deletes any other resources associated with it. Learn more about [cleaning up resources](../../quickstarts/create-communication-resource.md#clean-up-resources).
+
+## Next steps
+- Learn more about [Audio Streaming](../../concepts/call-automation/audio-streaming-concept.md).
+- Learn more about [Call Automation](../../concepts/call-automation/call-automation.md) and its features. 
+- Learn more about [Play action](../../concepts/call-automation/play-action.md).
+- Learn more about [Recognize action](../../concepts/call-automation/recognize-action.md).
@@ -0,0 +1,139 @@
+---
+title: Include file - C#
+description: C# Audio Streaming quickstart
+services: azure-communication-services
+author: Alvin
+ms.service: azure-communication-services
+ms.subservice: call-automation
+ms.date: 07/15/2024
+ms.topic: include
+ms.topic: Include file
+ms.author: alvinhan
+---
+
+## Prerequisites
+- An Azure account with an active subscription, for details see [Create an account for free.](https://azure.microsoft.com/free/)
+- An Azure Communication Services resource. See [Create an Azure Communication Services resource](../../../quickstarts/create-communication-resource.md?tabs=windows&pivots=platform-azp).
+- A new web service application created using the [Call Automation SDK](../../../quickstarts/call-automation/callflows-for-customer-interactions.md).
+- The latest [.NET library](https://dotnet.microsoft.com/download/dotnet-core) for your operating system.
+- A websocket server that can receive media streams.
+
+## Set up a websocket server
+Azure Communication Services requires your server application to set up a WebSocket server to stream audio in real-time. WebSocket is a standardized protocol that provides a full-duplex communication channel over a single TCP connection. 
+You can optionally use Azure services Azure WebApps that allows you to create an application to receive audio streams over a websocket connection. Follow this [quickstart](https://azure.microsoft.com/blog/introduction-to-websockets-on-windows-azure-web-sites/).
+
+## Establish a call
+Establish a call and provide streaming details
+
+``` C#
+MediaStreamingOptions mediaStreamingOptions = new MediaStreamingOptions( 
+    new Uri("<WEBSOCKET URL>"), 
+    MediaStreamingContent.Audio, 
+    MediaStreamingAudioChannel.Mixed, 
+    MediaStreamingTransport.Websocket, 
+    false); 
+
+ var createCallOptions = new CreateCallOptions(callInvite, callbackUri) 
+ { 
+     CallIntelligenceOptions = new CallIntelligenceOptions() { CognitiveServicesEndpoint = new Uri(cognitiveServiceEndpoint) }, 
+     MediaStreamingOptions = mediaStreamingOptions, 
+ }; 
+
+ CreateCallResult createCallResult = await callAutomationClient.CreateCallAsync(createCallOptions); 
+```
+
+## Start audio streaming
+How to start audio streaming:
+``` C#
+StartMediaStreamingOptions options = new StartMediaStreamingOptions() 
+    { 
+        OperationCallbackUri = new Uri(callbackUriHost), 
+        OperationContext = "startMediaStreamingContext" 
+    };
+    await callMedia.StartMediaStreamingAsync(options); 
+```
+When Azure Communication Services receives the URL for your WebSocket server, it creates a connection to it. Once Azure Communication Services successfully connects to your WebSocket server and streaming is started, it will send through the first data packet, which contains metadata about the incoming media packets. 
+
+The metadata packet will look like this:
+``` code
+{ 
+    "kind": <string> // What kind of data this is, e.g. AudioMetadata, AudioData. 
+    "audioMetadata": { 
+        "subscriptionId": <string>, // unique identifier for a subscription request 
+        "encoding":<string>, // PCM only supported 
+        "sampleRate": <int>, // 16000 default 
+        "channels": <int>, // 1 default 
+        "length": <int> // 640 default 
+    } 
+} 
+```
+
+
+## Stop audio streaming
+How to stop audio streaming
+``` C#
+StopMediaStreamingOptions stopOptions = new StopMediaStreamingOptions() 
+    { 
+        OperationCallbackUri = new Uri(callbackUriHost) 
+    }; 
+    await callMedia.StopMediaStreamingAsync(stopOptions); 
+```
+
+## Handling audio streams in your websocket server
+The sample below demonstrates how to listen to audio streams using your websocket server.
+
+``` C#
+HttpListener httpListener = new HttpListener(); 
+httpListener.Prefixes.Add("http://localhost:80/"); 
+httpListener.Start(); 
+
+while (true) 
+{ 
+    HttpListenerContext httpListenerContext = await httpListener.GetContextAsync(); 
+    if (httpListenerContext.Request.IsWebSocketRequest) 
+    { 
+        WebSocketContext websocketContext; 
+        try 
+        { 
+            websocketContext = await httpListenerContext.AcceptWebSocketAsync(subProtocol: null); 
+        } 
+        catch (Exception ex) 
+        { 
+            return; 
+        } 
+        WebSocket webSocket = websocketContext.WebSocket; 
+        try 
+        { 
+            while (webSocket.State == WebSocketState.Open || webSocket.State == WebSocketState.CloseSent) 
+            { 
+                byte[] receiveBuffer = new byte[2048]; 
+                var cancellationToken = new CancellationTokenSource(TimeSpan.FromSeconds(60)).Token; 
+                WebSocketReceiveResult receiveResult = await webSocket.ReceiveAsync(new ArraySegment<byte>(receiveBuffer), cancellationToken); 
+                if (receiveResult.MessageType != WebSocketMessageType.Close) 
+                { 
+                    var data = Encoding.UTF8.GetString(receiveBuffer).TrimEnd('\0'); 
+                    try 
+                    { 
+                        var eventData = JsonConvert.DeserializeObject<AudioBaseClass>(data); 
+                        if (eventData != null) 
+                        { 
+                            if(eventData.kind == "AudioMetadata") 
+                            { 
+                                //Process audio metadata 
+                            } 
+                            else if(eventData.kind == "AudioData")  
+                            { 
+                                //Process audio data 
+                                var byteArray = eventData.audioData.data; 
+                               //use audio byteArray as you want 
+                            } 
+                        } 
+                    } 
+                    catch { } 
+                } 
+            } 
+        } 
+        catch (Exception ex) { } 
+    } 
+} 
+```
Original file line number	Diff line number	Diff line change
`@@ -440,6 +440,16 @@`
`440`	`440`	`"source_path_from_root": "/articles/communication-services/quickstarts/media-composition/get-started-media-composition.md",`
`441`	`441`	`"redirect_url": "/azure/communication-services/quickstarts/voice-video-calling/getting-started-with-calling",`
`442`	`442`	`"redirect_document_id": false`
	`443`	`+ },`
	`444`	`+ {`
	`445`	`+ "source_path_from_root": "/articles/communication-services/quickstarts/voice-video-calling/media-streaming.md",`
	`446`	`+ "redirect_url": "/azure/communication-services/how-tos/call-automation/audio-streaming-quickstart",`
	`447`	`+ "redirect_document_id": false`
	`448`	`+ },`
	`449`	`+ {`
	`450`	`+ "source_path_from_root": "/articles/communication-services/concepts/voice-video-calling/media-streaming.md",`
	`451`	`+ "redirect_url": "/azure/communication-services/concepts/call-automation/audio-streaming-concept",`
	`452`	`+ "redirect_document_id": false`
`443`	`453`	`}`
`444`	`454`	`]`
`445`	`455`	`}`