Skip to content

Commit ea2575f

Browse files
authored
Merge pull request #2743 from MicrosoftDocs/main
2/5/2025 AM Publish
2 parents 3bc0655 + 30fcb77 commit ea2575f

25 files changed

+128
-110
lines changed

articles/ai-services/openai/concepts/model-retirements.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -109,7 +109,7 @@ These models are currently available for use in Azure OpenAI Service.
109109
| `gpt-4` | vision-preview | To be upgraded to `gpt-4` version: `turbo-2024-04-09`, starting no sooner than January 27, 2025 **<sup>1</sup>** | `gpt-4o`|
110110
| `gpt-4o` | 2024-05-13 | No earlier than May 20, 2025 <br><br>Deployments set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `2024-08-06`, starting on February 13, 2025. | |
111111
| `gpt-4o-mini` | 2024-07-18 | No earlier than July 18, 2025 | |
112-
| `gpt-4o-realtime-preview` | 2024-10-01 | No earlier than September 30, 2025 | `gpt-4o-realtime-preview` (version 2024-12-17) |
112+
| `gpt-4o-realtime-preview` | 2024-10-01 | No earlier than September 30, 2025 | `gpt-4o-realtime-preview` (version 2024-12-17) or `gpt-4o-mini-realtime-preview` (version 2024-12-17) |
113113
| `gpt-3.5-turbo-instruct` | 0914 | No earlier than April 1, 2025 | |
114114
| `o1` | 2024-12-17 | No earlier than December 17, 2025 | |
115115
| `text-embedding-ada-002` | 2 | No earlier than October 3, 2025 | `text-embedding-3-small` or `text-embedding-3-large` |

articles/ai-services/openai/concepts/models.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -72,6 +72,8 @@ Details about maximum request tokens and training data are available in the foll
7272

7373
| Model ID | Description | Max Request (tokens) | Training Data (up to) |
7474
|---|---|---|---|
75+
|`gpt-4o-mini-audio-preview` (2024-12-17) <br> **GPT-4o audio** | **Audio model** for audio and text generation. |Input: 128,000 <br> Output: 4,096 | Oct 2023 |
76+
|`gpt-4o-mini-realtime-preview` (2024-12-17) <br> **GPT-4o audio** | **Audio model** for real-time audio processing. |Input: 128,000 <br> Output: 4,096 | Oct 2023 |
7577
|`gpt-4o-audio-preview` (2024-12-17) <br> **GPT-4o audio** | **Audio model** for audio and text generation. |Input: 128,000 <br> Output: 4,096 | Oct 2023 |
7678
|`gpt-4o-realtime-preview` (2024-12-17) <br> **GPT-4o audio** | **Audio model** for real-time audio processing. |Input: 128,000 <br> Output: 4,096 | Oct 2023 |
7779
|`gpt-4o-realtime-preview` (2024-10-01) <br> **GPT-4o audio** | **Audio model** for real-time audio processing. |Input: 128,000 <br> Output: 4,096 | Oct 2023 |

articles/ai-services/openai/how-to/prompt-caching.md

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -28,7 +28,8 @@ Currently only the following models support prompt caching with Azure OpenAI:
2828
- `gpt-4o-2024-11-20`
2929
- `gpt-4o-2024-08-06`
3030
- `gpt-4o-mini-2024-07-18`
31-
- `gpt-4o-realtime-preview` (version 2024-12-17)`
31+
- `gpt-4o-realtime-preview` (version 2024-12-17)
32+
- `gpt-4o-mini-realtime-preview` (version 2024-12-17)
3233

3334
> [!NOTE]
3435
> Prompt caching is now also available as part of model fine-tuning for `gpt-4o` and `gpt-4o-mini`. Refer to the fine-tuning section of the [pricing page](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/) for details.
@@ -81,9 +82,9 @@ Prompt caching is supported for:
8182

8283
|**Caching supported**|**Description**|**Supported models**|
8384
|--------|--------|--------|
84-
| **Messages** | The complete messages array: system, developer, user, and assistant content | `gpt-4o`<br/>`gpt-4o-mini`<br/>`gpt-4o-realtime-preview` (version 2024-12-17) <br> `o1` (version 2024-12-17) |
85+
| **Messages** | The complete messages array: system, developer, user, and assistant content | `gpt-4o`<br/>`gpt-4o-mini`<br/>`gpt-4o-realtime-preview` (version 2024-12-17)<br/>`gpt-4o-mini-realtime-preview` (version 2024-12-17)<br> `o1` (version 2024-12-17) |
8586
| **Images** | Images included in user messages, both as links or as base64-encoded data. The detail parameter must be set the same across requests. | `gpt-4o`<br/>`gpt-4o-mini` <br> `o1` (version 2024-12-17) |
86-
| **Tool use** | Both the messages array and tool definitions. | `gpt-4o`<br/>`gpt-4o-mini`<br/>`gpt-4o-realtime-preview` (version 2024-12-17) <br> `o1` (version 2024-12-17) |
87+
| **Tool use** | Both the messages array and tool definitions. | `gpt-4o`<br/>`gpt-4o-mini`<br/>`gpt-4o-realtime-preview` (version 2024-12-17)<br/>`gpt-4o-mini-realtime-preview` (version 2024-12-17)<br> `o1` (version 2024-12-17) |
8788
| **Structured outputs** | Structured output schema is appended as a prefix to the system message. | `gpt-4o`<br/>`gpt-4o-mini` <br> `o1` (version 2024-12-17) |
8889

8990
To improve the likelihood of cache hits occurring, you should structure your requests such that repetitive content occurs at the beginning of the messages array.

articles/ai-services/openai/how-to/realtime-audio.md

Lines changed: 9 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -23,6 +23,7 @@ Most users of the Realtime API need to deliver and receive audio from an end-use
2323
## Supported models
2424

2525
The GPT 4o real-time models are available for global deployments in [East US 2 and Sweden Central regions](../concepts/models.md#global-standard-model-availability).
26+
- `gpt-4o-mini-realtime-preview` (2024-12-17)
2627
- `gpt-4o-realtime-preview` (2024-12-17)
2728
- `gpt-4o-realtime-preview` (2024-10-01)
2829

@@ -34,10 +35,10 @@ Before you can use GPT-4o real-time audio, you need:
3435

3536
- An Azure subscription - <a href="https://azure.microsoft.com/free/cognitive-services" target="_blank">Create one for free</a>.
3637
- An Azure OpenAI resource created in a [supported region](#supported-models). For more information, see [Create a resource and deploy a model with Azure OpenAI](create-resource.md).
37-
- You need a deployment of the `gpt-4o-realtime-preview` model in a supported region as described in the [supported models](#supported-models) section. You can deploy the model from the [Azure AI Foundry portal model catalog](../../../ai-studio/how-to/model-catalog-overview.md) or from your project in Azure AI Foundry portal.
38+
- You need a deployment of the `gpt-4o-realtime-preview` or `gpt-4o-mini-realtime-preview` model in a supported region as described in the [supported models](#supported-models) section. You can deploy the model from the [Azure AI Foundry portal model catalog](../../../ai-studio/how-to/model-catalog-overview.md) or from your project in Azure AI Foundry portal.
3839

3940
Here are some of the ways you can get started with the GPT-4o Realtime API for speech and audio:
40-
- For steps to deploy and use the `gpt-4o-realtime-preview` model, see [the real-time audio quickstart](../realtime-audio-quickstart.md).
41+
- For steps to deploy and use the `gpt-4o-realtime-preview` or `gpt-4o-mini-realtime-preview` model, see [the real-time audio quickstart](../realtime-audio-quickstart.md).
4142
- Download the sample code from the [Azure OpenAI GPT-4o real-time audio repository on GitHub](https://github.com/azure-samples/aoai-realtime-audio-sdk).
4243
- [The Azure-Samples/aisearch-openai-rag-audio repo](https://github.com/Azure-Samples/aisearch-openai-rag-audio) contains an example of how to implement RAG support in applications that use voice as their user interface, powered by the GPT-4o realtime API for audio.
4344

@@ -52,16 +53,16 @@ The Realtime API is accessed via a secure WebSocket connection to the `/realtime
5253

5354
You can construct a full request URI by concatenating:
5455

55-
- The secure WebSocket (`wss://`) protocol
56+
- The secure WebSocket (`wss://`) protocol.
5657
- Your Azure OpenAI resource endpoint hostname, for example, `my-aoai-resource.openai.azure.com`
57-
- The `openai/realtime` API path
58-
- An `api-version` query string parameter for a supported API version such as `2024-10-01-preview`
59-
- A `deployment` query string parameter with the name of your `gpt-4o-realtime-preview` model deployment
58+
- The `openai/realtime` API path.
59+
- An `api-version` query string parameter for a supported API version such as `2024-12-17`
60+
- A `deployment` query string parameter with the name of your `gpt-4o-realtime-preview` or `gpt-4o-mini-realtime-preview` model deployment.
6061

6162
The following example is a well-constructed `/realtime` request URI:
6263

6364
```http
64-
wss://my-eastus2-openai-resource.openai.azure.com/openai/realtime?api-version=2024-10-01-preview&deployment=gpt-4o-realtime-preview-deployment-name
65+
wss://my-eastus2-openai-resource.openai.azure.com/openai/realtime?api-version=2024-12-17&deployment=gpt-4o-mini-realtime-preview-deployment-name
6566
```
6667

6768
To authenticate:
@@ -346,7 +347,7 @@ When you connect to the `/realtime` endpoint, the server responds with a [`sessi
346347
"session": {
347348
"id": "REDACTED",
348349
"object": "realtime.session",
349-
"model": "gpt-4o-realtime-preview-2024-10-01",
350+
"model": "gpt-4o-mini-realtime-preview-2024-12-17",
350351
"expires_at": 1734626723,
351352
"modalities": [
352353
"audio",

articles/ai-services/openai/how-to/structured-outputs.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -20,7 +20,7 @@ Structured outputs make a model follow a [JSON Schema](https://json-schema.org/o
2020
> Currently structured outputs are not supported with:
2121
> - [Bring your own data](../concepts/use-your-data.md) scenarios.
2222
> - [Assistants](../how-to/assistant.md) or [Azure AI Agents Service](../../agents/overview.md).
23-
> - `gpt-4o-audio-preview` version: `2024-12-17`.
23+
> - `gpt-4o-audio-preview` and `gpt-4o-mini-audio-preview` version: `2024-12-17`.
2424
2525
## Supported models
2626

articles/ai-services/openai/includes/audio-completions-ai-foundry.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -15,11 +15,11 @@ ms.date: 1/7/2025
1515

1616
## Use GPT-4o audio generation
1717

18-
To chat with your deployed `gpt-4o-audio-preview` model in the **Chat** playground of [Azure AI Foundry portal](https://ai.azure.com), follow these steps:
18+
To chat with your deployed `gpt-4o-mini-audio-preview` model in the **Chat** playground of [Azure AI Foundry portal](https://ai.azure.com), follow these steps:
1919

20-
1. Go to the [Azure OpenAI Service page](https://ai.azure.com/resource/overview) in Azure AI Foundry portal. Make sure you're signed in with the Azure subscription that has your Azure OpenAI Service resource and the deployed `gpt-4o-audio-preview` model.
20+
1. Go to the [Azure OpenAI Service page](https://ai.azure.com/resource/overview) in Azure AI Foundry portal. Make sure you're signed in with the Azure subscription that has your Azure OpenAI Service resource and the deployed `gpt-4o-mini-audio-preview` model.
2121
1. Select the **Chat** playground from under **Resource playground** in the left pane.
22-
1. Select your deployed `gpt-4o-audio-preview` model from the **Deployment** dropdown.
22+
1. Select your deployed `gpt-4o-mini-audio-preview` model from the **Deployment** dropdown.
2323
1. Start chatting with the model and listen to the audio responses.
2424

2525
:::image type="content" source="../media/quickstarts/audio-completions-chat-playground.png" alt-text="Screenshot of the Chat playground page." lightbox="../media/quickstarts/audio-completions-chat-playground.png":::

articles/ai-services/openai/includes/audio-completions-deploy-model.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -7,12 +7,12 @@ ms.topic: include
77
ms.date: 1/21/2025
88
---
99

10-
To deploy the `gpt-4o-audio-preview` model in the Azure AI Foundry portal:
11-
1. Go to the [Azure OpenAI Service page](https://ai.azure.com/resource/overview) in Azure AI Foundry portal. Make sure you're signed in with the Azure subscription that has your Azure OpenAI Service resource and the deployed `gpt-4o-audio-preview` model.
10+
To deploy the `gpt-4o-mini-audio-preview` model in the Azure AI Foundry portal:
11+
1. Go to the [Azure OpenAI Service page](https://ai.azure.com/resource/overview) in Azure AI Foundry portal. Make sure you're signed in with the Azure subscription that has your Azure OpenAI Service resource and the deployed `gpt-4o-mini-audio-preview` model.
1212
1. Select the **Chat** playground from under **Playgrounds** in the left pane.
1313
1. Select **+ Create new deployment** > **From base models** to open the deployment window.
14-
1. Search for and select the `gpt-4o-audio-preview` model and then select **Deploy to selected resource**.
14+
1. Search for and select the `gpt-4o-mini-audio-preview` model and then select **Deploy to selected resource**.
1515
1. In the deployment wizard, select the `2024-12-17` model version.
1616
1. Follow the wizard to finish deploying the model.
1717

18-
Now that you have a deployment of the `gpt-4o-audio-preview` model, you can interact with it in the Azure AI Foundry portal **Chat** playground or chat completions API.
18+
Now that you have a deployment of the `gpt-4o-mini-audio-preview` model, you can interact with it in the Azure AI Foundry portal **Chat** playground or chat completions API.

articles/ai-services/openai/includes/audio-completions-intro.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ ms.topic: include
77
ms.date: 1/21/2025
88
---
99

10-
The `gpt-4o-audio-preview` model introduces the audio modality into the existing `/chat/completions` API. The audio model expands the potential for AI applications in text and voice-based interactions and audio analysis. Modalities supported in `gpt-4o-audio-preview` model include:  text, audio, and text + audio.
10+
The `gpt-4o-audio-preview` and `gpt-4o-mini-audio-preview` models introduce the audio modality into the existing `/chat/completions` API. The audio model expands the potential for AI applications in text and voice-based interactions and audio analysis. Modalities supported in `gpt-4o-audio-preview` and `gpt-4o-mini-audio-preview` models include:  text, audio, and text + audio.
1111

1212
Here's a table of the supported modalities with example use cases:
1313

@@ -23,9 +23,9 @@ By using audio generation capabilities, you can achieve more dynamic and interac
2323

2424
## Supported models
2525

26-
Currently only `gpt-4o-audio-preview` version: `2024-12-17` supports audio generation.
26+
Currently only `gpt-4o-audio-preview` and `gpt-4o-mini-audio-preview` version: `2024-12-17` supports audio generation.
2727

28-
The `gpt-4o-audio-preview` model is available for global deployments in [East US 2 and Sweden Central regions](../concepts/models.md#global-standard-model-availability).
28+
The `gpt-4o-audio-preview` and and `gpt-4o-mini-audio-preview` models are available for global deployments in [East US 2 and Sweden Central regions](../concepts/models.md#global-standard-model-availability).
2929

3030
Currently the following voices are supported for audio out: Alloy, Echo, and Shimmer.
3131

0 commit comments

Comments
 (0)