Skip to content

Commit e361c2c

Browse files
Merge pull request #7519 from PatrickFarley/aoai-audio
add realtime-mini mentions
2 parents 8d8ed59 + 1b1ce9d commit e361c2c

File tree

8 files changed

+16
-9
lines changed

8 files changed

+16
-9
lines changed

articles/ai-foundry/openai/how-to/prompt-caching.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -72,9 +72,9 @@ Prompt caching is supported for:
7272

7373
|**Caching supported**|**Description**|**Supported models**|
7474
|--------|--------|--------|
75-
| **Messages** | The complete messages array: system, developer, user, and assistant content | `gpt-4o`<br/>`gpt-4o-mini`<br/>`gpt-4o-realtime-preview` (version 2024-12-17)<br/>`gpt-4o-mini-realtime-preview` (version 2024-12-17)<br>`gpt-realtime` (version 2025-08-28)<br> `o1` (version 2024-12-17) <br> `o3-mini` (version 2025-01-31) |
75+
| **Messages** | The complete messages array: system, developer, user, and assistant content | `gpt-4o`<br/>`gpt-4o-mini`<br/>`gpt-4o-realtime-preview` (version 2024-12-17)<br/>`gpt-4o-mini-realtime-preview` (version 2024-12-17)<br>`gpt-realtime` (version 2025-08-28)<br> `gpt-realtime-mini` (version 2025-10-06)<br>`o1` (version 2024-12-17) <br> `o3-mini` (version 2025-01-31) |
7676
| **Images** | Images included in user messages, both as links or as base64-encoded data. The detail parameter must be set the same across requests. | `gpt-4o`<br/>`gpt-4o-mini` <br> `o1` (version 2024-12-17) |
77-
| **Tool use** | Both the messages array and tool definitions. | `gpt-4o`<br/>`gpt-4o-mini`<br/>`gpt-4o-realtime-preview` (version 2024-12-17)<br/>`gpt-4o-mini-realtime-preview` (version 2024-12-17)<br>`gpt-realtime` (version 2025-08-28)<br> `o1` (version 2024-12-17) <br> `o3-mini` (version 2025-01-31) |
77+
| **Tool use** | Both the messages array and tool definitions. | `gpt-4o`<br/>`gpt-4o-mini`<br/>`gpt-4o-realtime-preview` (version 2024-12-17)<br/>`gpt-4o-mini-realtime-preview` (version 2024-12-17)<br>`gpt-realtime` (version 2025-08-28)<br>`gpt-realtime-mini` (version 2025-10-06)<br> `o1` (version 2024-12-17) <br> `o3-mini` (version 2025-01-31) |
7878
| **Structured outputs** | Structured output schema is appended as a prefix to the system message. | `gpt-4o`<br/>`gpt-4o-mini` <br> `o1` (version 2024-12-17) <br> `o3-mini` (version 2025-01-31) |
7979

8080
To improve the likelihood of cache hits occurring, you should structure your requests such that repetitive content occurs at the beginning of the messages array.

articles/ai-foundry/openai/how-to/realtime-audio-webrtc.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -34,6 +34,7 @@ The GPT real-time models are available for global deployments in [East US 2 and
3434
- `gpt-4o-mini-realtime-preview` (2024-12-17)
3535
- `gpt-4o-realtime-preview` (2024-12-17)
3636
- `gpt-realtime` (version 2025-08-28)
37+
- `gpt-realtime-mini` (version 2025-10-06)
3738

3839
You should use API version `2025-04-01-preview` in the URL for the Realtime API. The API version is included in the sessions URL.
3940

@@ -45,7 +46,7 @@ Before you can use GPT real-time audio, you need:
4546

4647
- An Azure subscription - <a href="https://azure.microsoft.com/free/cognitive-services" target="_blank">Create one for free</a>.
4748
- An Azure OpenAI resource created in a [supported region](#supported-models). For more information, see [Create a resource and deploy a model with Azure OpenAI](create-resource.md).
48-
- You need a deployment of the `gpt-4o-realtime-preview`, `gpt-4o-mini-realtime-preview`, or `gpt-realtime` model in a supported region as described in the [supported models](#supported-models) section in this article. You can deploy the model from the [Azure AI Foundry model catalog](../../../ai-foundry/how-to/model-catalog-overview.md) or from your project in Azure AI Foundry portal.
49+
- You need a deployment of the `gpt-4o-realtime-preview`, `gpt-4o-mini-realtime-preview`, `gpt-realtime`, or `gpt-realtime-mini` model in a supported region as described in the [supported models](#supported-models) section in this article. You can deploy the model from the [Azure AI Foundry model catalog](../../../ai-foundry/how-to/model-catalog-overview.md) or from your project in Azure AI Foundry portal.
4950

5051
## Connection and authentication
5152

articles/ai-foundry/openai/how-to/realtime-audio-websockets.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -31,6 +31,7 @@ The GPT real-time models are available for global deployments in [East US 2 and
3131
- `gpt-4o-mini-realtime-preview` (2024-12-17)
3232
- `gpt-4o-realtime-preview` (2024-12-17)
3333
- `gpt-realtime` (version 2025-08-28)
34+
- `gpt-realtime-mini` (version 2025-10-06)
3435

3536
You should use API version `2025-04-01-preview` in the URL for the Realtime API.
3637

@@ -42,7 +43,7 @@ Before you can use GPT real-time audio, you need:
4243

4344
- An Azure subscription - <a href="https://azure.microsoft.com/free/cognitive-services" target="_blank">Create one for free</a>.
4445
- An Azure OpenAI resource created in a [supported region](#supported-models). For more information, see [Create a resource and deploy a model with Azure OpenAI](create-resource.md).
45-
- You need a deployment of the `gpt-4o-realtime-preview`, `gpt-4o-mini-realtime-preview`, or `gpt-realtime` model in a supported region as described in the [supported models](#supported-models) section. You can deploy the model from the [Azure AI Foundry portal model catalog](../../../ai-foundry/how-to/model-catalog-overview.md) or from your project in Azure AI Foundry portal.
46+
- You need a deployment of the `gpt-4o-realtime-preview`, `gpt-4o-mini-realtime-preview`, `gpt-realtime`, or `gpt-realtime-mini` model in a supported region as described in the [supported models](#supported-models) section. You can deploy the model from the [Azure AI Foundry portal model catalog](../../../ai-foundry/how-to/model-catalog-overview.md) or from your project in Azure AI Foundry portal.
4647

4748
## Connection and authentication
4849

articles/ai-foundry/openai/how-to/realtime-audio.md

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -29,6 +29,7 @@ The GPT real-time models are available for global deployments in [East US 2 and
2929
- `gpt-4o-mini-realtime-preview` (2024-12-17)
3030
- `gpt-4o-realtime-preview` (2024-12-17)
3131
- `gpt-realtime` (version 2025-08-28)
32+
- `gpt-realtime-mini` (version 2025-10-06)
3233

3334
You should use API version `2025-04-01-preview` in the URL for the Realtime API.
3435

@@ -40,10 +41,10 @@ Before you can use GPT real-time audio, you need:
4041

4142
- An Azure subscription - <a href="https://azure.microsoft.com/free/cognitive-services" target="_blank">Create one for free</a>.
4243
- An Azure OpenAI resource created in a [supported region](#supported-models). For more information, see [Create a resource and deploy a model with Azure OpenAI](create-resource.md).
43-
- You need a deployment of the `gpt-4o-realtime-preview`, `gpt-4o-mini-realtime-preview`, or `gpt-realtime` model in a supported region as described in the [supported models](#supported-models) section. You can deploy the model from the [Azure AI Foundry portal model catalog](../../../ai-foundry/how-to/model-catalog-overview.md) or from your project in Azure AI Foundry portal.
44+
- You need a deployment of the `gpt-4o-realtime-preview`, `gpt-4o-mini-realtime-preview`, `gpt-realtime`, or `gpt-realtime-mini` model in a supported region as described in the [supported models](#supported-models) section. You can deploy the model from the [Azure AI Foundry portal model catalog](../../../ai-foundry/how-to/model-catalog-overview.md) or from your project in Azure AI Foundry portal.
4445

4546
Here are some of the ways you can get started with the GPT Realtime API for speech and audio:
46-
- For steps to deploy and use the `gpt-4o-realtime-preview`, `gpt-4o-mini-realtime-preview`, or `gpt-realtime` model, see [the real-time audio quickstart](../realtime-audio-quickstart.md).
47+
- For steps to deploy and use the `gpt-4o-realtime-preview`, `gpt-4o-mini-realtime-preview`, `gpt-realtime`, or `gpt-realtime-mini` model, see [the real-time audio quickstart](../realtime-audio-quickstart.md).
4748
- Try the [WebRTC via HTML and JavaScript example](./realtime-audio-webrtc.md#webrtc-example-via-html-and-javascript) to get started with the Realtime API via WebRTC.
4849
- [The Azure-Samples/aisearch-openai-rag-audio repo](https://github.com/Azure-Samples/aisearch-openai-rag-audio) contains an example of how to implement RAG support in applications that use voice as their user interface, powered by the GPT realtime API for audio.
4950

@@ -281,7 +282,7 @@ A user might want to interrupt the assistant's response or ask the assistant to
281282

282283
## Image input
283284

284-
The `gpt-realtime` model supports image input as part of the conversation. The model can ground responses in what the user is currently seeing. You can send images to the model as part of a conversation item. The model can then generate responses that reference the images.
285+
The `gpt-realtime` and `gpt-realtime-mini` models support image input as part of the conversation. The model can ground responses in what the user is currently seeing. You can send images to the model as part of a conversation item. The model can then generate responses that reference the images.
285286

286287
The following example json body adds an image to the conversation:
287288

articles/ai-foundry/openai/includes/models-azure-direct-openai.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -313,7 +313,7 @@ Details about maximum request tokens and training data are available in the foll
313313
|`gpt-4o-realtime-preview` (2025-06-03) <br> GPT-4o audio | Audio model for real-time audio processing. |Input: 128,000 <br> Output: 4,096 | October 2023 |
314314
|`gpt-4o-realtime-preview` (2024-12-17) <br> GPT-4o audio | Audio model for real-time audio processing. |Input: 128,000 <br> Output: 4,096 | October 2023 |
315315
|`gpt-4o-mini-realtime-preview` (2024-12-17) <br> GPT-4o audio | Audio model for real-time audio processing. |Input: 128,000 <br> Output: 4,096 | October 2023 |
316-
|`gpt-realtime` (2025-08-28) (GA)<br> GPT-4o audio | Audio model for real-time audio processing. |Input: 28,672 <br> Output: 4,096 | October 2023 |
316+
|`gpt-realtime` (2025-08-28) (GA)<br>`gpt-realtime-mini` (2025-10-06)<br> `gpt-audio`(2025-08-28)<br>`gpt-audio-mini`(2025-10-06) | Audio model for real-time audio processing. |Input: 28,672 <br> Output: 4,096 | October 2023 |
317317

318318
To compare the availability of GPT-4o audio models across all regions, refer to the [models table](#global-standard-model-availability).
319319

articles/ai-foundry/openai/includes/realtime-python.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@ ms.date: 3/20/2025
1313
- An Azure subscription. <a href="https://azure.microsoft.com/free/ai-services" target="_blank">Create one for free</a>.
1414
- <a href="https://www.python.org/" target="_blank">Python 3.8 or later version</a>. We recommend using Python 3.10 or later, but having at least Python 3.8 is required. If you don't have a suitable version of Python installed, you can follow the instructions in the [VS Code Python Tutorial](https://code.visualstudio.com/docs/python/python-tutorial#_install-a-python-interpreter) for the easiest way of installing Python on your operating system.
1515
- An Azure OpenAI resource created in one of the supported regions. For more information about region availability, see the [models and versions documentation](../concepts/models.md#global-standard-model-availability).
16-
- Then, you need to deploy a `gpt-realtime` model with your Azure OpenAI resource. For more information, see [Create a resource and deploy a model with Azure OpenAI](../how-to/create-resource.md).
16+
- Then, you need to deploy a `gpt-realtime` or `gpt-realtime-mini` model with your Azure OpenAI resource. For more information, see [Create a resource and deploy a model with Azure OpenAI](../how-to/create-resource.md).
1717

1818
## Microsoft Entra ID prerequisites
1919

articles/ai-foundry/openai/quotas-limits.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -252,7 +252,10 @@ During the preview, the rate limits for each `gpt-4o` realtime model deployment
252252
|`gpt-4o-mini-audio-preview` | Default | 2M | 1K |
253253
|`gpt-4o-mini-realtime-preview` | Default | 800K | 1K |
254254
|`gpt-audio` | Default | 100K | 30 |
255+
|`gpt-audio-mini` | Default | 100K | 30 |
255256
|`gpt-realtime` | Default | 100K | 30 |
257+
|`gpt-realtime-mini` | Default | 100K | 30 |
258+
256259

257260
## GPT-image-1 rate limits
258261

articles/ai-foundry/openai/realtime-audio-quickstart.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -32,6 +32,7 @@ The GPT real-time models are available for global deployments.
3232
- `gpt-4o-realtime-preview` (version `2024-12-17`)
3333
- `gpt-4o-mini-realtime-preview` (version `2024-12-17`)
3434
- `gpt-realtime` (version `2025-08-28`)
35+
- `gpt-realtime-mini` (version `2025-10-06`)
3536

3637
See the [models and versions documentation](./concepts/models.md#audio-models) for more information.
3738

0 commit comments

Comments
 (0)