Skip to content

Commit 88ecb95

Browse files
committed
Revert "add realtime-mini mentions"
This reverts commit 1b1ce9d.
1 parent 1b1ce9d commit 88ecb95

File tree

8 files changed

+9
-16
lines changed

8 files changed

+9
-16
lines changed

articles/ai-foundry/openai/how-to/prompt-caching.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -72,9 +72,9 @@ Prompt caching is supported for:
7272

7373
|**Caching supported**|**Description**|**Supported models**|
7474
|--------|--------|--------|
75-
| **Messages** | The complete messages array: system, developer, user, and assistant content | `gpt-4o`<br/>`gpt-4o-mini`<br/>`gpt-4o-realtime-preview` (version 2024-12-17)<br/>`gpt-4o-mini-realtime-preview` (version 2024-12-17)<br>`gpt-realtime` (version 2025-08-28)<br> `gpt-realtime-mini` (version 2025-10-06)<br>`o1` (version 2024-12-17) <br> `o3-mini` (version 2025-01-31) |
75+
| **Messages** | The complete messages array: system, developer, user, and assistant content | `gpt-4o`<br/>`gpt-4o-mini`<br/>`gpt-4o-realtime-preview` (version 2024-12-17)<br/>`gpt-4o-mini-realtime-preview` (version 2024-12-17)<br>`gpt-realtime` (version 2025-08-28)<br> `o1` (version 2024-12-17) <br> `o3-mini` (version 2025-01-31) |
7676
| **Images** | Images included in user messages, both as links or as base64-encoded data. The detail parameter must be set the same across requests. | `gpt-4o`<br/>`gpt-4o-mini` <br> `o1` (version 2024-12-17) |
77-
| **Tool use** | Both the messages array and tool definitions. | `gpt-4o`<br/>`gpt-4o-mini`<br/>`gpt-4o-realtime-preview` (version 2024-12-17)<br/>`gpt-4o-mini-realtime-preview` (version 2024-12-17)<br>`gpt-realtime` (version 2025-08-28)<br>`gpt-realtime-mini` (version 2025-10-06)<br> `o1` (version 2024-12-17) <br> `o3-mini` (version 2025-01-31) |
77+
| **Tool use** | Both the messages array and tool definitions. | `gpt-4o`<br/>`gpt-4o-mini`<br/>`gpt-4o-realtime-preview` (version 2024-12-17)<br/>`gpt-4o-mini-realtime-preview` (version 2024-12-17)<br>`gpt-realtime` (version 2025-08-28)<br> `o1` (version 2024-12-17) <br> `o3-mini` (version 2025-01-31) |
7878
| **Structured outputs** | Structured output schema is appended as a prefix to the system message. | `gpt-4o`<br/>`gpt-4o-mini` <br> `o1` (version 2024-12-17) <br> `o3-mini` (version 2025-01-31) |
7979

8080
To improve the likelihood of cache hits occurring, you should structure your requests such that repetitive content occurs at the beginning of the messages array.

articles/ai-foundry/openai/how-to/realtime-audio-webrtc.md

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -34,7 +34,6 @@ The GPT real-time models are available for global deployments in [East US 2 and
3434
- `gpt-4o-mini-realtime-preview` (2024-12-17)
3535
- `gpt-4o-realtime-preview` (2024-12-17)
3636
- `gpt-realtime` (version 2025-08-28)
37-
- `gpt-realtime-mini` (version 2025-10-06)
3837

3938
You should use API version `2025-04-01-preview` in the URL for the Realtime API. The API version is included in the sessions URL.
4039

@@ -46,7 +45,7 @@ Before you can use GPT real-time audio, you need:
4645

4746
- An Azure subscription - <a href="https://azure.microsoft.com/free/cognitive-services" target="_blank">Create one for free</a>.
4847
- An Azure OpenAI resource created in a [supported region](#supported-models). For more information, see [Create a resource and deploy a model with Azure OpenAI](create-resource.md).
49-
- You need a deployment of the `gpt-4o-realtime-preview`, `gpt-4o-mini-realtime-preview`, `gpt-realtime`, or `gpt-realtime-mini` model in a supported region as described in the [supported models](#supported-models) section in this article. You can deploy the model from the [Azure AI Foundry model catalog](../../../ai-foundry/how-to/model-catalog-overview.md) or from your project in Azure AI Foundry portal.
48+
- You need a deployment of the `gpt-4o-realtime-preview`, `gpt-4o-mini-realtime-preview`, or `gpt-realtime` model in a supported region as described in the [supported models](#supported-models) section in this article. You can deploy the model from the [Azure AI Foundry model catalog](../../../ai-foundry/how-to/model-catalog-overview.md) or from your project in Azure AI Foundry portal.
5049

5150
## Connection and authentication
5251

articles/ai-foundry/openai/how-to/realtime-audio-websockets.md

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -31,7 +31,6 @@ The GPT real-time models are available for global deployments in [East US 2 and
3131
- `gpt-4o-mini-realtime-preview` (2024-12-17)
3232
- `gpt-4o-realtime-preview` (2024-12-17)
3333
- `gpt-realtime` (version 2025-08-28)
34-
- `gpt-realtime-mini` (version 2025-10-06)
3534

3635
You should use API version `2025-04-01-preview` in the URL for the Realtime API.
3736

@@ -43,7 +42,7 @@ Before you can use GPT real-time audio, you need:
4342

4443
- An Azure subscription - <a href="https://azure.microsoft.com/free/cognitive-services" target="_blank">Create one for free</a>.
4544
- An Azure OpenAI resource created in a [supported region](#supported-models). For more information, see [Create a resource and deploy a model with Azure OpenAI](create-resource.md).
46-
- You need a deployment of the `gpt-4o-realtime-preview`, `gpt-4o-mini-realtime-preview`, `gpt-realtime`, or `gpt-realtime-mini` model in a supported region as described in the [supported models](#supported-models) section. You can deploy the model from the [Azure AI Foundry portal model catalog](../../../ai-foundry/how-to/model-catalog-overview.md) or from your project in Azure AI Foundry portal.
45+
- You need a deployment of the `gpt-4o-realtime-preview`, `gpt-4o-mini-realtime-preview`, or `gpt-realtime` model in a supported region as described in the [supported models](#supported-models) section. You can deploy the model from the [Azure AI Foundry portal model catalog](../../../ai-foundry/how-to/model-catalog-overview.md) or from your project in Azure AI Foundry portal.
4746

4847
## Connection and authentication
4948

articles/ai-foundry/openai/how-to/realtime-audio.md

Lines changed: 3 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -29,7 +29,6 @@ The GPT real-time models are available for global deployments in [East US 2 and
2929
- `gpt-4o-mini-realtime-preview` (2024-12-17)
3030
- `gpt-4o-realtime-preview` (2024-12-17)
3131
- `gpt-realtime` (version 2025-08-28)
32-
- `gpt-realtime-mini` (version 2025-10-06)
3332

3433
You should use API version `2025-04-01-preview` in the URL for the Realtime API.
3534

@@ -41,10 +40,10 @@ Before you can use GPT real-time audio, you need:
4140

4241
- An Azure subscription - <a href="https://azure.microsoft.com/free/cognitive-services" target="_blank">Create one for free</a>.
4342
- An Azure OpenAI resource created in a [supported region](#supported-models). For more information, see [Create a resource and deploy a model with Azure OpenAI](create-resource.md).
44-
- You need a deployment of the `gpt-4o-realtime-preview`, `gpt-4o-mini-realtime-preview`, `gpt-realtime`, or `gpt-realtime-mini` model in a supported region as described in the [supported models](#supported-models) section. You can deploy the model from the [Azure AI Foundry portal model catalog](../../../ai-foundry/how-to/model-catalog-overview.md) or from your project in Azure AI Foundry portal.
43+
- You need a deployment of the `gpt-4o-realtime-preview`, `gpt-4o-mini-realtime-preview`, or `gpt-realtime` model in a supported region as described in the [supported models](#supported-models) section. You can deploy the model from the [Azure AI Foundry portal model catalog](../../../ai-foundry/how-to/model-catalog-overview.md) or from your project in Azure AI Foundry portal.
4544

4645
Here are some of the ways you can get started with the GPT Realtime API for speech and audio:
47-
- For steps to deploy and use the `gpt-4o-realtime-preview`, `gpt-4o-mini-realtime-preview`, `gpt-realtime`, or `gpt-realtime-mini` model, see [the real-time audio quickstart](../realtime-audio-quickstart.md).
46+
- For steps to deploy and use the `gpt-4o-realtime-preview`, `gpt-4o-mini-realtime-preview`, or `gpt-realtime` model, see [the real-time audio quickstart](../realtime-audio-quickstart.md).
4847
- Try the [WebRTC via HTML and JavaScript example](./realtime-audio-webrtc.md#webrtc-example-via-html-and-javascript) to get started with the Realtime API via WebRTC.
4948
- [The Azure-Samples/aisearch-openai-rag-audio repo](https://github.com/Azure-Samples/aisearch-openai-rag-audio) contains an example of how to implement RAG support in applications that use voice as their user interface, powered by the GPT realtime API for audio.
5049

@@ -282,7 +281,7 @@ A user might want to interrupt the assistant's response or ask the assistant to
282281

283282
## Image input
284283

285-
The `gpt-realtime` and `gpt-realtime-mini` models support image input as part of the conversation. The model can ground responses in what the user is currently seeing. You can send images to the model as part of a conversation item. The model can then generate responses that reference the images.
284+
The `gpt-realtime` model supports image input as part of the conversation. The model can ground responses in what the user is currently seeing. You can send images to the model as part of a conversation item. The model can then generate responses that reference the images.
286285

287286
The following example json body adds an image to the conversation:
288287

articles/ai-foundry/openai/includes/models-azure-direct-openai.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -313,7 +313,7 @@ Details about maximum request tokens and training data are available in the foll
313313
|`gpt-4o-realtime-preview` (2025-06-03) <br> GPT-4o audio | Audio model for real-time audio processing. |Input: 128,000 <br> Output: 4,096 | October 2023 |
314314
|`gpt-4o-realtime-preview` (2024-12-17) <br> GPT-4o audio | Audio model for real-time audio processing. |Input: 128,000 <br> Output: 4,096 | October 2023 |
315315
|`gpt-4o-mini-realtime-preview` (2024-12-17) <br> GPT-4o audio | Audio model for real-time audio processing. |Input: 128,000 <br> Output: 4,096 | October 2023 |
316-
|`gpt-realtime` (2025-08-28) (GA)<br>`gpt-realtime-mini` (2025-10-06)<br> `gpt-audio`(2025-08-28)<br>`gpt-audio-mini`(2025-10-06) | Audio model for real-time audio processing. |Input: 28,672 <br> Output: 4,096 | October 2023 |
316+
|`gpt-realtime` (2025-08-28) (GA)<br> GPT-4o audio | Audio model for real-time audio processing. |Input: 28,672 <br> Output: 4,096 | October 2023 |
317317

318318
To compare the availability of GPT-4o audio models across all regions, refer to the [models table](#global-standard-model-availability).
319319

articles/ai-foundry/openai/includes/realtime-python.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@ ms.date: 3/20/2025
1313
- An Azure subscription. <a href="https://azure.microsoft.com/free/ai-services" target="_blank">Create one for free</a>.
1414
- <a href="https://www.python.org/" target="_blank">Python 3.8 or later version</a>. We recommend using Python 3.10 or later, but having at least Python 3.8 is required. If you don't have a suitable version of Python installed, you can follow the instructions in the [VS Code Python Tutorial](https://code.visualstudio.com/docs/python/python-tutorial#_install-a-python-interpreter) for the easiest way of installing Python on your operating system.
1515
- An Azure OpenAI resource created in one of the supported regions. For more information about region availability, see the [models and versions documentation](../concepts/models.md#global-standard-model-availability).
16-
- Then, you need to deploy a `gpt-realtime` or `gpt-realtime-mini` model with your Azure OpenAI resource. For more information, see [Create a resource and deploy a model with Azure OpenAI](../how-to/create-resource.md).
16+
- Then, you need to deploy a `gpt-realtime` model with your Azure OpenAI resource. For more information, see [Create a resource and deploy a model with Azure OpenAI](../how-to/create-resource.md).
1717

1818
## Microsoft Entra ID prerequisites
1919

articles/ai-foundry/openai/quotas-limits.md

Lines changed: 0 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -252,10 +252,7 @@ During the preview, the rate limits for each `gpt-4o` realtime model deployment
252252
|`gpt-4o-mini-audio-preview` | Default | 2M | 1K |
253253
|`gpt-4o-mini-realtime-preview` | Default | 800K | 1K |
254254
|`gpt-audio` | Default | 100K | 30 |
255-
|`gpt-audio-mini` | Default | 100K | 30 |
256255
|`gpt-realtime` | Default | 100K | 30 |
257-
|`gpt-realtime-mini` | Default | 100K | 30 |
258-
259256

260257
## GPT-image-1 rate limits
261258

articles/ai-foundry/openai/realtime-audio-quickstart.md

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -32,7 +32,6 @@ The GPT real-time models are available for global deployments.
3232
- `gpt-4o-realtime-preview` (version `2024-12-17`)
3333
- `gpt-4o-mini-realtime-preview` (version `2024-12-17`)
3434
- `gpt-realtime` (version `2025-08-28`)
35-
- `gpt-realtime-mini` (version `2025-10-06`)
3635

3736
See the [models and versions documentation](./concepts/models.md#audio-models) for more information.
3837

0 commit comments

Comments
 (0)