Skip to content

Commit 3f75345

Browse files
authored
Merge pull request #743 from eric-urban/eur/realtime-qs
real-time quickstart
2 parents 55a0908 + b849db7 commit 3f75345

File tree

5 files changed

+16
-11
lines changed

5 files changed

+16
-11
lines changed

articles/ai-services/.openpublishing.redirection.ai-services.json

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -135,6 +135,11 @@
135135
"redirect_url": "/azure/ai-services/openai/concepts/fine-tuning-considerations",
136136
"redirect_document_id": true
137137
},
138+
{
139+
"source_path_from_root": "/articles/ai-services/openai/how-to/audio-real-time.md",
140+
"redirect_url": "/azure/ai-services/openai/realtime-audio-quickstart",
141+
"redirect_document_id": false
142+
},
138143
{
139144
"source_path_from_root": "/articles/ai-services/content-safety/concepts/incident-response.md",
140145
"redirect_url": "/azure/ai-services/content-safety/concepts/custom-categories-rapid",

articles/ai-services/openai/concepts/models.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -229,7 +229,7 @@ The `gpt-4o-realtime-preview` model is part of the GPT-4o model family and suppo
229229

230230
GPT-4o audio is available in the East US 2 (`eastus2`) and Sweden Central (`swedencentral`) regions. To use GPT-4o audio, you need to [create](../how-to/create-resource.md) or use an existing resource in one of the supported regions.
231231

232-
When your resource is created, you can [deploy](../how-to/create-resource.md#deploy-a-model) the GPT-4o audio model. If you are performing a programmatic deployment, the **model** name is `gpt-4o-realtime-preview`. For more information on how to use GPT-4o audio, see the [GPT-4o audio documentation](../how-to/audio-real-time.md).
232+
When your resource is created, you can [deploy](../how-to/create-resource.md#deploy-a-model) the GPT-4o audio model. If you are performing a programmatic deployment, the **model** name is `gpt-4o-realtime-preview`. For more information on how to use GPT-4o audio, see the [GPT-4o audio documentation](../realtime-audio-quickstart.md).
233233

234234
Details about maximum request tokens and training data are available in the following table.
235235

articles/ai-services/openai/how-to/audio-real-time.md renamed to articles/ai-services/openai/realtime-audio-quickstart.md

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -23,7 +23,7 @@ Most users of the Realtime API need to deliver and receive audio from an end-use
2323

2424
Currently only `gpt-4o-realtime-preview` version: `2024-10-01-preview` supports real-time audio.
2525

26-
The `gpt-4o-realtime-preview` model is available for global deployments in [East US 2 and Sweden Central regions](../concepts/models.md#global-standard-model-availability).
26+
The `gpt-4o-realtime-preview` model is available for global deployments in [East US 2 and Sweden Central regions](./concepts/models.md#global-standard-model-availability).
2727

2828
> [!IMPORTANT]
2929
> The system stores your prompts and completions as described in the "Data Use and Access for Abuse Monitoring" section of the service-specific Product Terms for Azure OpenAI Service, except that the Limited Exception does not apply. Abuse monitoring will be turned on for use of the `gpt-4o-realtime-preview` API even for customers who otherwise are approved for modified abuse monitoring.
@@ -38,13 +38,13 @@ Support for the Realtime API was first added in API version `2024-10-01-preview`
3838
## Prerequisites
3939

4040
- An Azure subscription - <a href="https://azure.microsoft.com/free/cognitive-services" target="_blank">Create one for free</a>.
41-
- An Azure OpenAI resource created in a [supported region](#supported-models). For more information, see [Create a resource and deploy a model with Azure OpenAI](../how-to/create-resource.md).
41+
- An Azure OpenAI resource created in a [supported region](#supported-models). For more information, see [Create a resource and deploy a model with Azure OpenAI](./how-to/create-resource.md).
4242

4343
## Deploy a model for real-time audio
4444

4545
Before you can use GPT-4o real-time audio, you need a deployment of the `gpt-4o-realtime-preview` model in a supported region as described in the [supported models](#supported-models) section.
4646

47-
You can deploy the model from the [Azure AI Studio model catalog](../../../ai-studio/how-to/model-catalog-overview.md) or from your project in AI Studio. Follow these steps to deploy a `gpt-4o-realtime-preview` model from the model catalog:
47+
You can deploy the model from the [Azure AI Studio model catalog](../../ai-studio/how-to/model-catalog-overview.md) or from your project in AI Studio. Follow these steps to deploy a `gpt-4o-realtime-preview` model from the model catalog:
4848

4949
1. Sign in to [AI Studio](https://ai.azure.com) and go to the **Home** page.
5050
1. Select **Model catalog** from the left sidebar.
@@ -71,13 +71,13 @@ To chat with your deployed `gpt-4o-realtime-preview` model in the [Azure AI Stud
7171
1. Select your deployed `gpt-4o-realtime-preview` model from the **Deployment** dropdown.
7272
1. Select **Enable microphone** to allow the browser to access your microphone. If you already granted permission, you can skip this step.
7373

74-
:::image type="content" source="../media/how-to/real-time/real-time-playground.png" alt-text="Screenshot of the real-time audio playground with the deployed model selected." lightbox="../media/how-to/real-time/real-time-playground.png":::
74+
:::image type="content" source="./media/how-to/real-time/real-time-playground.png" alt-text="Screenshot of the real-time audio playground with the deployed model selected." lightbox="./media/how-to/real-time/real-time-playground.png":::
7575

7676
1. Optionally you can edit contents in the **Give the model instructions and context** text box. Give the model instructions about how it should behave and any context it should reference when generating a response. You can describe the assistant's personality, tell it what it should and shouldn't answer, and tell it how to format responses.
7777
1. Optionally, change settings such as threshold, prefix padding, and silence duration.
7878
1. Select **Start listening** to start the session. You can speak into the microphone to start a chat.
7979

80-
:::image type="content" source="../media/how-to/real-time/real-time-playground-start-listening.png" alt-text="Screenshot of the real-time audio playground with the start listening button and microphone access enabled." lightbox="../media/how-to/real-time/real-time-playground-start-listening.png":::
80+
:::image type="content" source="./media/how-to/real-time/real-time-playground-start-listening.png" alt-text="Screenshot of the real-time audio playground with the start listening button and microphone access enabled." lightbox="./media/how-to/real-time/real-time-playground-start-listening.png":::
8181

8282
1. You can interrupt the chat at any time by speaking. You can end the chat by selecting the **Stop listening** button.
8383

@@ -129,5 +129,5 @@ You can run the sample code locally on your machine by following these steps. Re
129129
130130
## Related content
131131
132-
* Learn more about Azure OpenAI [deployment types](./deployment-types.md)
133-
* Learn more about Azure OpenAI [quotas and limits](../quotas-limits.md)
132+
* Learn more about Azure OpenAI [deployment types](./how-to/deployment-types.md)
133+
* Learn more about Azure OpenAI [quotas and limits](quotas-limits.md)

articles/ai-services/openai/toc.yml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -38,6 +38,8 @@ items:
3838
href: dall-e-quickstart.md
3939
- name: Use your data
4040
href: use-your-data-quickstart.md
41+
- name: Realtime API for speech and audio (preview)
42+
href: realtime-audio-quickstart.md
4143
- name: Whisper model
4244
href: whisper-quickstart.md
4345
- name: Text to speech (preview)
@@ -162,8 +164,6 @@ items:
162164
- name: Function calling
163165
href: ./how-to/fine-tuning-functions.md
164166
displayName: fine-tuning, finetuning
165-
- name: Realtime API for speech and audio (Preview)
166-
href: ./how-to/audio-real-time.md
167167
- name: Use your data
168168
items:
169169
- name: Text data

articles/ai-services/openai/whats-new.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -44,7 +44,7 @@ Azure OpenAI GPT-4o audio is part of the GPT-4o model family that supports low-l
4444

4545
The `gpt-4o-realtime-preview` model is available for global deployments in [East US 2 and Sweden Central regions](./concepts/models.md#global-standard-model-availability).
4646

47-
For more information, see the [GPT-4o real-time audio documentation](./how-to/audio-real-time.md).
47+
For more information, see the [GPT-4o real-time audio documentation](realtime-audio-quickstart.md).
4848

4949
### Global batch support updates
5050

0 commit comments

Comments
 (0)