Skip to content

Commit 8b6dfac

Browse files
committed
gpt-4o real-time audio preview
1 parent f766e0c commit 8b6dfac

File tree

1 file changed

+16
-13
lines changed

1 file changed

+16
-13
lines changed

articles/ai-services/openai/how-to/audio-real-time.md

Lines changed: 16 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
2-
title: 'How to use GPT-4o real-time audio with Azure OpenAI Service'
2+
title: 'How to use GPT-4o Realtime API for speech and audio with Azure OpenAI Service'
33
titleSuffix: Azure OpenAI
4-
description: Learn how to use GPT-4o real-time audio with Azure OpenAI Service.
4+
description: Learn how to use GPT-4o Realtime API for speech and audio with Azure OpenAI Service.
55
manager: nitinme
66
ms.service: azure-ai-openai
77
ms.topic: how-to
@@ -12,11 +12,11 @@ ms.custom: references_regions
1212
recommendations: false
1313
---
1414

15-
# GPT-4o real-time audio
15+
# GPT-4o Realtime API for speech and audio
1616

17-
Azure OpenAI GPT-4o audio is part of the GPT-4o model family that supports low-latency, "speech in, speech out" conversational interactions. The GPT-4o audio `realtime` API is designed to handle real-time, low-latency conversational interactions, making it a great fit for use cases involving live interactions between a user and a model, such as customer support agents, voice assistants, and real-time translators.
17+
Azure OpenAI GPT-4o Realtime API for speech and audio is part of the GPT-4o model family that supports low-latency, "speech in, speech out" conversational interactions. The GPT-4o audio `realtime` API is designed to handle real-time, low-latency conversational interactions, making it a great fit for use cases involving live interactions between a user and a model, such as customer support agents, voice assistants, and real-time translators.
1818

19-
Most users of this API need to deliver and receive audio from an end-user in real time, including applications that use WebRTC or a telephony system. The real-time API isn't designed to connect directly to end user devices and relies on client integrations to terminate end user audio streams.
19+
Most users of this API need to deliver and receive audio from an end-user in real time, including applications that use WebRTC or a telephony system. The Realtime API isn't designed to connect directly to end user devices and relies on client integrations to terminate end user audio streams.
2020

2121
## Supported models
2222

@@ -29,7 +29,7 @@ The `gpt-4o-realtime-preview` model is available for global deployments in [East
2929
3030
## API support
3131

32-
Support for real-time audio was first added in API version `2024-10-01-preview`.
32+
Support for the Realtime API was first added in API version `2024-10-01-preview`.
3333

3434
> [!NOTE]
3535
> For more information about the API and architecture, see the [Azure OpenAI GPT-4o real-time audio repository on GitHub](https://github.com/azure-samples/aoai-realtime-audio-sdk).
@@ -56,15 +56,18 @@ You can deploy the model from the Azure OpenAI model catalog or from your projec
5656

5757
Now that you have a deployment of the `gpt-4o-realtime-preview` model, you can use the playground to interact with the model in real time. Select **Early access playground** from the list of playgrounds in the left pane.
5858

59-
## Use the GPT-4o real-time audio API
59+
## Use the GPT-4o Realtime API
6060

6161
> [!TIP]
6262
> A playground for GPT-4o real-time audio is coming soon to [Azure AI Studio](https://ai.azure.com). You can already use the API directly in your application.
6363
64-
Right now, the fastest way to get started with GPT-4o real-time audio is to download the sample code from the [Azure OpenAI GPT-4o real-time audio repository on GitHub](https://github.com/azure-samples/aoai-realtime-audio-sdk).
64+
Right now, the fastest way to get started with the GPT-4o Realtime API is to download the sample code from the [Azure OpenAI GPT-4o real-time audio repository on GitHub](https://github.com/azure-samples/aoai-realtime-audio-sdk).
65+
66+
The JavaScript web sample demonstrates how to use the GPT-4o Realtime API to interact with the model in real time. The sample code includes a simple web interface that captures audio from the user's microphone and sends it to the model for processing. The model responds with text and audio, which the sample code renders in the web interface.
67+
68+
You can run the sample code locally on your machine by following these steps. Refer to the [repository on GitHub](https://github.com/azure-samples/aoai-realtime-audio-sdk) for the most up-to-date instructions.
69+
1. If you don't have Node.js installed, download and install the [LTS version of Node.js](https://nodejs.org/).
6570

66-
The JavaScript web sample demonstrates how to use the GPT-4o real-time audio API to interact with the model in real time. The sample code includes a simple web interface that captures audio from the user's microphone and sends it to the model for processing. The model responds with text and audio, which the sample code renders in the web interface.
67-
6871
1. Clone the repository to your local machine:
6972

7073
```bash
@@ -74,12 +77,12 @@ The JavaScript web sample demonstrates how to use the GPT-4o real-time audio API
7477
1. Go to the `javascript/samples/web` folder in your preferred code editor.
7578

7679
```bash
77-
cd .\javascript\samples\web\
80+
cd ./javascript/samples
7881
```
7982

80-
1. If you don't have Node.js installed, download and install the [LTS version of Node.js](https://nodejs.org/).
83+
1. Run `download-pkg.ps1` or `download-pkg.sh` to download the required packages.
8184

82-
1. Run `npm install` to download a few dependency packages. For more information, see the `package.json` file in the same `web` folder.
85+
1. Run `npm install` to install package dependencies.
8386

8487
1. Run `npm run dev` to start the web server, navigating any firewall permissions prompts as needed.
8588
1. Go to any of the provided URIs from the console output (such as `http://localhost:5173/`) in a browser.

0 commit comments

Comments
 (0)