You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/openai/concepts/model-retirements.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -109,7 +109,7 @@ These models are currently available for use in Azure OpenAI Service.
109
109
|`gpt-4`| vision-preview | To be upgraded to `gpt-4` version: `turbo-2024-04-09`, starting no sooner than January 27, 2025 **<sup>1</sup>**|`gpt-4o`|
110
110
|`gpt-4o`| 2024-05-13 | No earlier than May 20, 2025 <br><br>Deployments set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `2024-08-06`, starting on February 13, 2025. ||
111
111
|`gpt-4o-mini`| 2024-07-18 | No earlier than July 18, 2025 ||
112
-
|`gpt-4o-realtime-preview`| 2024-10-01 | No earlier than September 30, 2025 |`gpt-4o-realtime-preview` (version 2024-12-17) |
112
+
|`gpt-4o-realtime-preview`| 2024-10-01 | No earlier than September 30, 2025 |`gpt-4o-realtime-preview` (version 2024-12-17) or `gpt-4o-mini-realtime-preview` (version 2024-12-17) |
113
113
|`gpt-3.5-turbo-instruct`| 0914 | No earlier than April 1, 2025 ||
114
114
|`o1`| 2024-12-17 | No earlier than December 17, 2025 ||
115
115
|`text-embedding-ada-002`| 2 | No earlier than October 3, 2025 |`text-embedding-3-small` or `text-embedding-3-large`|
Copy file name to clipboardExpand all lines: articles/ai-services/openai/concepts/models.md
+2Lines changed: 2 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -72,6 +72,8 @@ Details about maximum request tokens and training data are available in the foll
72
72
73
73
| Model ID | Description | Max Request (tokens) | Training Data (up to) |
74
74
|---|---|---|---|
75
+
|`gpt-4o-mini-audio-preview` (2024-12-17) <br> **GPT-4o audio**|**Audio model** for audio and text generation. |Input: 128,000 <br> Output: 4,096 | Oct 2023 |
76
+
|`gpt-4o-mini-realtime-preview` (2024-12-17) <br> **GPT-4o audio**|**Audio model** for real-time audio processing. |Input: 128,000 <br> Output: 4,096 | Oct 2023 |
75
77
|`gpt-4o-audio-preview` (2024-12-17) <br> **GPT-4o audio**|**Audio model** for audio and text generation. |Input: 128,000 <br> Output: 4,096 | Oct 2023 |
76
78
|`gpt-4o-realtime-preview` (2024-12-17) <br> **GPT-4o audio**|**Audio model** for real-time audio processing. |Input: 128,000 <br> Output: 4,096 | Oct 2023 |
77
79
|`gpt-4o-realtime-preview` (2024-10-01) <br> **GPT-4o audio**|**Audio model** for real-time audio processing. |Input: 128,000 <br> Output: 4,096 | Oct 2023 |
> Prompt caching is now also available as part of model fine-tuning for `gpt-4o` and `gpt-4o-mini`. Refer to the fine-tuning section of the [pricing page](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/) for details.
@@ -81,9 +82,9 @@ Prompt caching is supported for:
|**Messages**| The complete messages array: system, developer, user, and assistant content |`gpt-4o`<br/>`gpt-4o-mini`<br/>`gpt-4o-realtime-preview` (version 2024-12-17)<br> `o1` (version 2024-12-17) |
85
+
|**Messages**| The complete messages array: system, developer, user, and assistant content |`gpt-4o`<br/>`gpt-4o-mini`<br/>`gpt-4o-realtime-preview` (version 2024-12-17)<br/>`gpt-4o-mini-realtime-preview` (version 2024-12-17)<br> `o1` (version 2024-12-17) |
85
86
|**Images**| Images included in user messages, both as links or as base64-encoded data. The detail parameter must be set the same across requests. |`gpt-4o`<br/>`gpt-4o-mini` <br> `o1` (version 2024-12-17) |
86
-
|**Tool use**| Both the messages array and tool definitions. |`gpt-4o`<br/>`gpt-4o-mini`<br/>`gpt-4o-realtime-preview` (version 2024-12-17)<br> `o1` (version 2024-12-17) |
87
+
|**Tool use**| Both the messages array and tool definitions. |`gpt-4o`<br/>`gpt-4o-mini`<br/>`gpt-4o-realtime-preview` (version 2024-12-17)<br/>`gpt-4o-mini-realtime-preview` (version 2024-12-17)<br> `o1` (version 2024-12-17) |
87
88
|**Structured outputs**| Structured output schema is appended as a prefix to the system message. |`gpt-4o`<br/>`gpt-4o-mini` <br> `o1` (version 2024-12-17) |
88
89
89
90
To improve the likelihood of cache hits occurring, you should structure your requests such that repetitive content occurs at the beginning of the messages array.
Copy file name to clipboardExpand all lines: articles/ai-services/openai/how-to/realtime-audio.md
+9-8Lines changed: 9 additions & 8 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -23,6 +23,7 @@ Most users of the Realtime API need to deliver and receive audio from an end-use
23
23
## Supported models
24
24
25
25
The GPT 4o real-time models are available for global deployments in [East US 2 and Sweden Central regions](../concepts/models.md#global-standard-model-availability).
26
+
-`gpt-4o-mini-realtime-preview` (2024-12-17)
26
27
-`gpt-4o-realtime-preview` (2024-12-17)
27
28
-`gpt-4o-realtime-preview` (2024-10-01)
28
29
@@ -34,10 +35,10 @@ Before you can use GPT-4o real-time audio, you need:
34
35
35
36
- An Azure subscription - <ahref="https://azure.microsoft.com/free/cognitive-services"target="_blank">Create one for free</a>.
36
37
- An Azure OpenAI resource created in a [supported region](#supported-models). For more information, see [Create a resource and deploy a model with Azure OpenAI](create-resource.md).
37
-
- You need a deployment of the `gpt-4o-realtime-preview` model in a supported region as described in the [supported models](#supported-models) section. You can deploy the model from the [Azure AI Foundry portal model catalog](../../../ai-studio/how-to/model-catalog-overview.md) or from your project in Azure AI Foundry portal.
38
+
- You need a deployment of the `gpt-4o-realtime-preview`or `gpt-4o-mini-realtime-preview`model in a supported region as described in the [supported models](#supported-models) section. You can deploy the model from the [Azure AI Foundry portal model catalog](../../../ai-studio/how-to/model-catalog-overview.md) or from your project in Azure AI Foundry portal.
38
39
39
40
Here are some of the ways you can get started with the GPT-4o Realtime API for speech and audio:
40
-
- For steps to deploy and use the `gpt-4o-realtime-preview` model, see [the real-time audio quickstart](../realtime-audio-quickstart.md).
41
+
- For steps to deploy and use the `gpt-4o-realtime-preview`or `gpt-4o-mini-realtime-preview`model, see [the real-time audio quickstart](../realtime-audio-quickstart.md).
41
42
- Download the sample code from the [Azure OpenAI GPT-4o real-time audio repository on GitHub](https://github.com/azure-samples/aoai-realtime-audio-sdk).
42
43
-[The Azure-Samples/aisearch-openai-rag-audio repo](https://github.com/Azure-Samples/aisearch-openai-rag-audio) contains an example of how to implement RAG support in applications that use voice as their user interface, powered by the GPT-4o realtime API for audio.
43
44
@@ -52,16 +53,16 @@ The Realtime API is accessed via a secure WebSocket connection to the `/realtime
52
53
53
54
You can construct a full request URI by concatenating:
54
55
55
-
- The secure WebSocket (`wss://`) protocol
56
+
- The secure WebSocket (`wss://`) protocol.
56
57
- Your Azure OpenAI resource endpoint hostname, for example, `my-aoai-resource.openai.azure.com`
57
-
- The `openai/realtime` API path
58
-
- An `api-version` query string parameter for a supported API version such as `2024-10-01-preview`
59
-
- A `deployment` query string parameter with the name of your `gpt-4o-realtime-preview` model deployment
58
+
- The `openai/realtime` API path.
59
+
- An `api-version` query string parameter for a supported API version such as `2024-12-17`
60
+
- A `deployment` query string parameter with the name of your `gpt-4o-realtime-preview`or `gpt-4o-mini-realtime-preview`model deployment.
60
61
61
62
The following example is a well-constructed `/realtime` request URI:
Copy file name to clipboardExpand all lines: articles/ai-services/openai/includes/audio-completions-ai-foundry.md
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -15,11 +15,11 @@ ms.date: 1/7/2025
15
15
16
16
## Use GPT-4o audio generation
17
17
18
-
To chat with your deployed `gpt-4o-audio-preview` model in the **Chat** playground of [Azure AI Foundry portal](https://ai.azure.com), follow these steps:
18
+
To chat with your deployed `gpt-4o-mini-audio-preview` model in the **Chat** playground of [Azure AI Foundry portal](https://ai.azure.com), follow these steps:
19
19
20
-
1. Go to the [Azure OpenAI Service page](https://ai.azure.com/resource/overview) in Azure AI Foundry portal. Make sure you're signed in with the Azure subscription that has your Azure OpenAI Service resource and the deployed `gpt-4o-audio-preview` model.
20
+
1. Go to the [Azure OpenAI Service page](https://ai.azure.com/resource/overview) in Azure AI Foundry portal. Make sure you're signed in with the Azure subscription that has your Azure OpenAI Service resource and the deployed `gpt-4o-mini-audio-preview` model.
21
21
1. Select the **Chat** playground from under **Resource playground** in the left pane.
22
-
1. Select your deployed `gpt-4o-audio-preview` model from the **Deployment** dropdown.
22
+
1. Select your deployed `gpt-4o-mini-audio-preview` model from the **Deployment** dropdown.
23
23
1. Start chatting with the model and listen to the audio responses.
24
24
25
25
:::image type="content" source="../media/quickstarts/audio-completions-chat-playground.png" alt-text="Screenshot of the Chat playground page." lightbox="../media/quickstarts/audio-completions-chat-playground.png":::
Copy file name to clipboardExpand all lines: articles/ai-services/openai/includes/audio-completions-deploy-model.md
+4-4Lines changed: 4 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,12 +7,12 @@ ms.topic: include
7
7
ms.date: 1/21/2025
8
8
---
9
9
10
-
To deploy the `gpt-4o-audio-preview` model in the Azure AI Foundry portal:
11
-
1. Go to the [Azure OpenAI Service page](https://ai.azure.com/resource/overview) in Azure AI Foundry portal. Make sure you're signed in with the Azure subscription that has your Azure OpenAI Service resource and the deployed `gpt-4o-audio-preview` model.
10
+
To deploy the `gpt-4o-mini-audio-preview` model in the Azure AI Foundry portal:
11
+
1. Go to the [Azure OpenAI Service page](https://ai.azure.com/resource/overview) in Azure AI Foundry portal. Make sure you're signed in with the Azure subscription that has your Azure OpenAI Service resource and the deployed `gpt-4o-mini-audio-preview` model.
12
12
1. Select the **Chat** playground from under **Playgrounds** in the left pane.
13
13
1. Select **+ Create new deployment** > **From base models** to open the deployment window.
14
-
1. Search for and select the `gpt-4o-audio-preview` model and then select **Deploy to selected resource**.
14
+
1. Search for and select the `gpt-4o-mini-audio-preview` model and then select **Deploy to selected resource**.
15
15
1. In the deployment wizard, select the `2024-12-17` model version.
16
16
1. Follow the wizard to finish deploying the model.
17
17
18
-
Now that you have a deployment of the `gpt-4o-audio-preview` model, you can interact with it in the Azure AI Foundry portal **Chat** playground or chat completions API.
18
+
Now that you have a deployment of the `gpt-4o-mini-audio-preview` model, you can interact with it in the Azure AI Foundry portal **Chat** playground or chat completions API.
Copy file name to clipboardExpand all lines: articles/ai-services/openai/includes/audio-completions-intro.md
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,7 +7,7 @@ ms.topic: include
7
7
ms.date: 1/21/2025
8
8
---
9
9
10
-
The `gpt-4o-audio-preview`model introduces the audio modality into the existing `/chat/completions` API. The audio model expands the potential for AI applications in text and voice-based interactions and audio analysis. Modalities supported in `gpt-4o-audio-preview`model include: text, audio, and text + audio.
10
+
The `gpt-4o-audio-preview`and `gpt-4o-mini-audio-preview` models introduce the audio modality into the existing `/chat/completions` API. The audio model expands the potential for AI applications in text and voice-based interactions and audio analysis. Modalities supported in `gpt-4o-audio-preview`and `gpt-4o-mini-audio-preview` models include: text, audio, and text + audio.
11
11
12
12
Here's a table of the supported modalities with example use cases:
13
13
@@ -23,9 +23,9 @@ By using audio generation capabilities, you can achieve more dynamic and interac
23
23
24
24
## Supported models
25
25
26
-
Currently only `gpt-4o-audio-preview` version: `2024-12-17` supports audio generation.
26
+
Currently only `gpt-4o-audio-preview`and `gpt-4o-mini-audio-preview`version: `2024-12-17` supports audio generation.
27
27
28
-
The `gpt-4o-audio-preview`model is available for global deployments in [East US 2 and Sweden Central regions](../concepts/models.md#global-standard-model-availability).
28
+
The `gpt-4o-audio-preview`and and `gpt-4o-mini-audio-preview` models are available for global deployments in [East US 2 and Sweden Central regions](../concepts/models.md#global-standard-model-availability).
29
29
30
30
Currently the following voices are supported for audio out: Alloy, Echo, and Shimmer.
0 commit comments