You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/openai/how-to/audio-real-time.md
+8-3Lines changed: 8 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -5,7 +5,7 @@ description: Learn how to use GPT-4o Realtime API for speech and audio with Azur
5
5
manager: nitinme
6
6
ms.service: azure-ai-openai
7
7
ms.topic: how-to
8
-
ms.date: 10/1/2024
8
+
ms.date: 10/3/2024
9
9
author: eric-urban
10
10
ms.author: eur
11
11
ms.custom: references_regions
@@ -57,9 +57,10 @@ You can deploy the model from the [Azure AI Studio model catalog](../../../ai-st
57
57
58
58
Now that you have a deployment of the `gpt-4o-realtime-preview` model, you can use the AI Studio **Real-time audio** playground or Realtime API to interact with it in real time.
59
59
60
-
## Use the GPT-4o Realtime API
60
+
## Use the GPT-4o real-time audio
61
61
62
-
Right now, the fastest way to get started with the GPT-4o Realtime API is to download the sample code from the [Azure OpenAI GPT-4o real-time audio repository on GitHub](https://github.com/azure-samples/aoai-realtime-audio-sdk).
62
+
> [!TIP]
63
+
> Right now, the fastest way to get started development with the GPT-4o Realtime API is to download the sample code from the [Azure OpenAI GPT-4o real-time audio repository on GitHub](https://github.com/azure-samples/aoai-realtime-audio-sdk).
63
64
64
65
::: zone pivot="programming-language-ai-studio"
65
66
@@ -68,12 +69,16 @@ To chat with your deployed `gpt-4o-realtime-preview` model in the [Azure AI Stud
68
69
1. Go to your project in [Azure AI Studio](https://ai.azure.com).
69
70
1. Select **Playgrounds** > **Real-time audio** from the left pane.
70
71
1. Select your deployed `gpt-4o-realtime-preview` model from the **Deployment** dropdown.
72
+
1. Select **Enable microphone** to allow the browser to access your microphone. If you already granted permission, you can skip this step.
71
73
72
74
:::image type="content" source="../media/how-to/real-time/real-time-playground.png" alt-text="Screenshot of the real-time audio playground with the deployed model selected." lightbox="../media/how-to/real-time/real-time-playground.png":::
73
75
74
76
1. Optionally you can edit contents in the **Give the model instructions and context** text box. Give the model instructions about how it should behave and any context it should reference when generating a response. You can describe the assistant's personality, tell it what it should and shouldn't answer, and tell it how to format responses.
75
77
1. Optionally, change settings such as threshold, prefix padding, and silence duration.
76
78
1. Select **Start listening** to start the session. You can speak into the microphone to start a chat.
79
+
80
+
:::image type="content" source="../media/how-to/real-time/real-time-playground-start-listening.png" alt-text="Screenshot of the real-time audio playground with the start listening button and microphone access enabled." lightbox="../media/how-to/real-time/real-time-playground-start-listening.png":::
81
+
77
82
1. You can interrupt the chat at any time by speaking. You can end the chat by selecting the **Stop listening** button.
0 commit comments