MicrosoftDocs
diff --git a/‎articles/ai-services/openai/how-to/fine-tuning.md‎
Lines changed: 2 additions & 2 deletions b/‎articles/ai-services/openai/how-to/fine-tuning.md‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎articles/ai-services/openai/includes/env-var-key.md‎
Lines changed: 18 additions & 0 deletions b/‎articles/ai-services/openai/includes/env-var-key.md‎
Lines changed: 18 additions & 0 deletions
diff --git a/‎articles/ai-services/openai/includes/env-var-without-key.md‎
Lines changed: 15 additions & 0 deletions b/‎articles/ai-services/openai/includes/env-var-without-key.md‎
Lines changed: 15 additions & 0 deletions
diff --git a/‎articles/ai-services/openai/includes/realtime-deploy-model.md‎
Lines changed: 18 additions & 0 deletions b/‎articles/ai-services/openai/includes/realtime-deploy-model.md‎
Lines changed: 18 additions & 0 deletions
diff --git a/‎articles/ai-services/openai/includes/realtime-javascript.md‎
Lines changed: 278 additions & 0 deletions b/‎articles/ai-services/openai/includes/realtime-javascript.md‎
Lines changed: 278 additions & 0 deletions
diff --git a/‎articles/ai-services/openai/includes/realtime-portal.md‎
Lines changed: 31 additions & 0 deletions b/‎articles/ai-services/openai/includes/realtime-portal.md‎
Lines changed: 31 additions & 0 deletions
@@ -44,12 +44,12 @@ We use LoRA, or low rank approximation, to fine-tune models in a way that reduce
 
 ::: zone-end
 
-## Global Standard
+## Global Standard (preview)
 
 Azure OpenAI fine-tuning supports [global standard deployments](./deployment-types.md#global-standard) in East US2, North Central US, and Sweden Central for:
 
-- `gpt-4o-2024-08-06`
 - `gpt-4o-mini-2024-07-18`
+- `gpt-4o-2024-08-06` (New deployments aren't available until January 2025)
 
 Global standard fine-tuned deployments offer [cost savings](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/), but custom model weights may temporarily be stored outside the geography of your Azure OpenAI resource.
 
 
@@ -0,0 +1,18 @@
+---
+author: eric-urban 
+ms.author: eur 
+ms.service: azure-ai-openai
+ms.topic: include
+ms.date: 12/27/2024
+---
+
+|Variable name | Value |
+|--------------------------|-------------|
+| `AZURE_OPENAI_ENDPOINT`               | This value can be found in the **Keys and Endpoint** section when examining your resource from the Azure portal. |
+| `AZURE_OPENAI_API_KEY` | This value can be found in the **Keys and Endpoint** section when examining your resource from the Azure portal. You can use either `KEY1` or `KEY2`.|
+| `AZURE_OPENAI_DEPLOYMENT_NAME` | This value will correspond to the custom name you chose for your deployment when you deployed a model. This value can be found under **Resource Management** > **Model Deployments** in the Azure portal.|
+| `OPENAI_API_VERSION`|Learn more about [API Versions](/azure/ai-services/openai/api-version-deprecation).|
+
+Learn more about [finding API keys](/azure/ai-services/cognitive-services-environment-variables) and [setting environment variables](/azure/ai-services/cognitive-services-environment-variables).
+
+[!INCLUDE [Azure key vault](~/reusable-content/ce-skilling/azure/includes/ai-services/security/azure-key-vault.md)]
@@ -0,0 +1,15 @@
+---
+author: eric-urban 
+ms.author: eur 
+ms.service: azure-ai-openai
+ms.topic: include
+ms.date: 12/27/2024
+---
+
+|Variable name | Value |
+|--------------------------|-------------|
+| `AZURE_OPENAI_ENDPOINT`               | This value can be found in the **Keys and Endpoint** section when examining your resource from the Azure portal. |
+| `AZURE_OPENAI_DEPLOYMENT_NAME` | This value will correspond to the custom name you chose for your deployment when you deployed a model. This value can be found under **Resource Management** > **Model Deployments** in the Azure portal.|
+| `OPENAI_API_VERSION`|Learn more about [API Versions](/azure/ai-services/openai/api-version-deprecation).|
+
+Learn more about [keyless authentication](/azure/ai-services/authentication) and [setting environment variables](/azure/ai-services/cognitive-services-environment-variables).
@@ -0,0 +1,18 @@
+---
+manager: nitinme
+author: eric-urban
+ms.author: eur
+ms.service: azure-ai-openai
+ms.topic: include
+ms.date: 12/26/2024
+---
+
+To deploy the `gpt-4o-realtime-preview` model in the Azure AI Foundry portal:
+1. Go to the [Azure AI Foundry portal](https://ai.azure.com) and make sure you're signed in with the Azure subscription that has your Azure OpenAI Service resource (with or without model deployments.)
+1. Select the **Real-time audio** playground from under **Playgrounds** in the left pane.
+1. Select **Create new deployment** to open the deployment window. 
+1. Search for and select the `gpt-4o-realtime-preview` model and then select **Confirm**.
+1. In the deployment wizard, make sure to select the `2024-10-01` model version.
+1. Follow the wizard to finish deploying the model.
+
+Now that you have a deployment of the `gpt-4o-realtime-preview` model, you can interact with it in real time in the Azure AI Foundry portal **Real-time audio** playground or Realtime API.
@@ -0,0 +1,278 @@
+---
+manager: nitinme
+author: eric-urban
+ms.author: eur
+ms.service: azure-ai-openai
+ms.topic: include
+ms.date: 12/26/2024
+---
+
+## Prerequisites
+
+- An Azure subscription - <a href="https://azure.microsoft.com/free/cognitive-services" target="_blank">Create one for free</a>
+- <a href="https://nodejs.org/" target="_blank">Node.js LTS or ESM support.</a>
+- An Azure OpenAI resource created in the East US 2 or Sweden Central regions. See [Region availability](/azure/ai-services/openai/concepts/models#model-summary-table-and-region-availability).
+- Then, you need to deploy a `gpt-4o-realtime-preview` model with your Azure OpenAI resource. For more information, see [Create a resource and deploy a model with Azure OpenAI](../how-to/create-resource.md). 
+
+## Microsoft Entra ID prerequisites
+
+For the recommended keyless authentication with Microsoft Entra ID, you need to:
+- Install the [Azure CLI](/cli/azure/install-azure-cli) used for keyless authentication with Microsoft Entra ID.
+- Assign the `Cognitive Services User` role to your user account. You can assign roles in the Azure portal under **Access control (IAM)** > **Add role assignment**.
+
+## Deploy a model for real-time audio
+
+[!INCLUDE [Deploy model](realtime-deploy-model.md)]
+
+## Set up
+
+1. Create a new folder `realtime-audio-quickstart` to contain the application and open Visual Studio Code in that folder with the following command:
+
+    ```shell
+    mkdir realtime-audio-quickstart && code realtime-audio-quickstart
+    ```
+    
+1. Create the `package.json` with the following command:
+
+    ```shell
+    npm init -y
+    ```
+
+1. Update the `package.json` to ECMAScript with the following command: 
+
+    ```shell
+    npm pkg set type=module
+    ```
+    
+
+1. Install the real-time audio client library for JavaScript with:
+
+    ```console
+    npm install https://github.com/Azure-Samples/aoai-realtime-audio-sdk/releases/download/js/v0.5.2/rt-client-0.5.2.tgz
+    ```
+
+1. For the **recommended** keyless authentication with Microsoft Entra ID, install the `@azure/identity` package with:
+
+    ```console
+    npm install @azure/identity
+    ```
+
+## Retrieve resource information
+
+#### [Microsoft Entra ID](#tab/javascript-keyless)
+
+[!INCLUDE [keyless-environment-variables](env-var-without-key.md)]
+
+#### [API key](#tab/javascript-key)
+
+[!INCLUDE [key-environment-variables](env-var-key.md)]
+
+---
+
+> [!CAUTION]
+> To use the recommended keyless authentication with the SDK, make sure that the `AZURE_OPENAI_API_KEY` environment variable isn't set. 
+
+## Text in audio out
+
+#### [Microsoft Entra ID](#tab/javascript-keyless)
+
+1. Create the `text-in-audio-out.js` file with the following code:
+
+    ```javascript 
+    import { DefaultAzureCredential } from "@azure/identity";
+    import { LowLevelRTClient } from "rt-client";
+    import dotenv from "dotenv";
+    dotenv.config();
+    async function text_in_audio_out() {
+        // Set environment variables or edit the corresponding values here.
+        const endpoint = process.env["AZURE_OPENAI_ENDPOINT"] || "yourEndpoint";
+        const deployment = "gpt-4o-realtime-preview";
+        if (!endpoint || !deployment) {
+            throw new Error("You didn't set the environment variables.");
+        }
+        const client = new LowLevelRTClient(new URL(endpoint), new DefaultAzureCredential(), { deployment: deployment });
+        try {
+            await client.send({
+                type: "response.create",
+                response: {
+                    modalities: ["audio", "text"],
+                    instructions: "Please assist the user."
+                }
+            });
+            for await (const message of client.messages()) {
+                switch (message.type) {
+                    case "response.done": {
+                        break;
+                    }
+                    case "error": {
+                        console.error(message.error);
+                        break;
+                    }
+                    case "response.audio_transcript.delta": {
+                        console.log(`Received text delta: ${message.delta}`);
+                        break;
+                    }
+                    case "response.audio.delta": {
+                        const buffer = Buffer.from(message.delta, "base64");
+                        console.log(`Received ${buffer.length} bytes of audio data.`);
+                        break;
+                    }
+                }
+                if (message.type === "response.done" || message.type === "error") {
+                    break;
+                }
+            }
+        }
+        finally {
+            client.close();
+        }
+    }
+    await text_in_audio_out();
+    ```
+
+1. Sign in to Azure with the following command:
+
+    ```shell
+    az login
+    ```
+
+1. Run the JavaScript file.
+
+    ```shell
+    node text-in-audio-out.js
+    ```
+
+
+#### [API key](#tab/javascript-key)
+
+1. Create the `text-in-audio-out.js` file with the following code:
+
+    ```javascript 
+    import { AzureKeyCredential } from "@azure/core-auth";
+    import { LowLevelRTClient } from "rt-client";
+    import dotenv from "dotenv";
+    dotenv.config();
+    async function text_in_audio_out() {
+        // Set environment variables or edit the corresponding values here.
+        const apiKey = process.env["AZURE_OPENAI_API_KEY"] || "yourKey";
+        const endpoint = process.env["AZURE_OPENAI_ENDPOINT"] || "yourEndpoint";
+        const deployment = "gpt-4o-realtime-preview";
+        if (!endpoint || !deployment) {
+            throw new Error("You didn't set the environment variables.");
+        }
+        const client = new LowLevelRTClient(new URL(endpoint), new AzureKeyCredential(apiKey), { deployment: deployment });
+        try {
+            await client.send({
+                type: "response.create",
+                response: {
+                    modalities: ["audio", "text"],
+                    instructions: "Please assist the user."
+                }
+            });
+            for await (const message of client.messages()) {
+                switch (message.type) {
+                    case "response.done": {
+                        break;
+                    }
+                    case "error": {
+                        console.error(message.error);
+                        break;
+                    }
+                    case "response.audio_transcript.delta": {
+                        console.log(`Received text delta: ${message.delta}`);
+                        break;
+                    }
+                    case "response.audio.delta": {
+                        const buffer = Buffer.from(message.delta, "base64");
+                        console.log(`Received ${buffer.length} bytes of audio data.`);
+                        break;
+                    }
+                }
+                if (message.type === "response.done" || message.type === "error") {
+                    break;
+                }
+            }
+        }
+        finally {
+            client.close();
+        }
+    }
+    await text_in_audio_out();
+    ```
+
+1. Run the JavaScript file.
+
+    ```shell
+    node text-in-audio-out.js
+    ```
+
+---
+
+Wait a few moments to get the response.
+
+## Output
+
+The script gets a response from the model and prints the transcript and audio data received.
+
+The output will look similar to the following:
+
+```console
+Received text delta: Hello
+Received text delta: !
+Received text delta:  How
+Received text delta:  can
+Received text delta:  I
+Received 4800 bytes of audio data.
+Received 7200 bytes of audio data.
+Received text delta:  help
+Received 12000 bytes of audio data.
+Received text delta:  you
+Received text delta:  today
+Received text delta: ?
+Received 12000 bytes of audio data.
+Received 12000 bytes of audio data.
+Received 12000 bytes of audio data.
+Received 24000 bytes of audio data.
+```
+
+## Web application sample
+
+Our JavaScript web sample [on GitHub](https://github.com/azure-samples/aoai-realtime-audio-sdk) demonstrates how to use the GPT-4o Realtime API to interact with the model in real time. The sample code includes a simple web interface that captures audio from the user's microphone and sends it to the model for processing. The model responds with text and audio, which the sample code renders in the web interface.
+
+You can run the sample code locally on your machine by following these steps. Refer to the [repository on GitHub](https://github.com/azure-samples/aoai-realtime-audio-sdk) for the most up-to-date instructions.
+1. If you don't have Node.js installed, download and install the [LTS version of Node.js](https://nodejs.org/).
+
+1. Clone the repository to your local machine:
+    
+    ```bash
+    git clone https://github.com/Azure-Samples/aoai-realtime-audio-sdk.git
+    ```
+
+1. Go to the `javascript/samples/web` folder in your preferred code editor.
+
+    ```bash
+    cd ./javascript/samples
+    ```
+
+1. Run `download-pkg.ps1` or `download-pkg.sh` to download the required packages. 
+
+1. Go to the `web` folder from the `./javascript/samples` folder.
+
+    ```bash
+    cd ./web
+    ```
+
+1. Run `npm install` to install package dependencies.
+
+1. Run `npm run dev` to start the web server, navigating any firewall permissions prompts as needed.
+1. Go to any of the provided URIs from the console output (such as `http://localhost:5173/`) in a browser.
+1. Enter the following information in the web interface:
+    - **Endpoint**: The resource endpoint of an Azure OpenAI resource. You don't need to append the `/realtime` path. An example structure might be `https://my-azure-openai-resource-from-portal.openai.azure.com`.
+    - **API Key**: A corresponding API key for the Azure OpenAI resource.
+    - **Deployment**: The name of the `gpt-4o-realtime-preview` model that [you deployed in the previous section](#deploy-a-model-for-real-time-audio).
+    - **System Message**: Optionally, you can provide a system message such as "You always talk like a friendly pirate."
+    - **Temperature**: Optionally, you can provide a custom temperature.
+    - **Voice**: Optionally, you can select a voice.
+1. Select the **Record** button to start the session. Accept permissions to use your microphone if prompted.
+1. You should see a `<< Session Started >>` message in the main output. Then you can speak into the microphone to start a chat.
+1. You can interrupt the chat at any time by speaking. You can end the chat by selecting the **Stop** button.
@@ -0,0 +1,31 @@
+---
+manager: nitinme
+author: eric-urban
+ms.author: eur
+ms.service: azure-ai-openai
+ms.topic: include
+ms.date: 12/26/2024
+---
+
+## Deploy a model for real-time audio
+
+[!INCLUDE [Deploy model](realtime-deploy-model.md)]
+
+## Use the GPT-4o real-time audio
+
+To chat with your deployed `gpt-4o-realtime-preview` model in the [Azure AI Foundry](https://ai.azure.com) **Real-time audio** playground, follow these steps:
+
+1. Go to the [Azure OpenAI Service page](https://ai.azure.com/resource/overview) in Azure AI Foundry portal. Make sure you're signed in with the Azure subscription that has your Azure OpenAI Service resource and the deployed `gpt-4o-realtime-preview` model.
+1. Select the **Real-time audio** playground from under **Playgrounds** in the left pane.
+1. Select your deployed `gpt-4o-realtime-preview` model from the **Deployment** dropdown. 
+1. Select **Enable microphone** to allow the browser to access your microphone. If you already granted permission, you can skip this step.
+
+    :::image type="content" source="../media/how-to/real-time/real-time-playground.png" alt-text="Screenshot of the real-time audio playground with the deployed model selected." lightbox="../media/how-to/real-time/real-time-playground.png":::
+
+1. Optionally, you can edit contents in the **Give the model instructions and context** text box. Give the model instructions about how it should behave and any context it should reference when generating a response. You can describe the assistant's personality, tell it what it should and shouldn't answer, and tell it how to format responses.
+1. Optionally, change settings such as threshold, prefix padding, and silence duration.
+1. Select **Start listening** to start the session. You can speak into the microphone to start a chat.
+
+    :::image type="content" source="../media/how-to/real-time/real-time-playground-start-listening.png" alt-text="Screenshot of the real-time audio playground with the start listening button and microphone access enabled." lightbox="../media/how-to/real-time/real-time-playground-start-listening.png":::
+
+1. You can interrupt the chat at any time by speaking. You can end the chat by selecting the **Stop listening** button.