Skip to content

Commit 20af0cd

Browse files
authored
Merge pull request #4172 from MicrosoftDocs/release-2025-openai-o1-pro
[Azure OpenAI] Release branch to main tracking branch
2 parents 2a558e5 + 8af67e0 commit 20af0cd

File tree

7 files changed

+49
-26
lines changed

7 files changed

+49
-26
lines changed

articles/ai-services/openai/concepts/models.md

Lines changed: 8 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ titleSuffix: Azure OpenAI
44
description: Learn about the different model capabilities that are available with Azure OpenAI.
55
ms.service: azure-ai-openai
66
ms.topic: conceptual
7-
ms.date: 04/15/2025
7+
ms.date: 04/16/2025
88
ms.custom: references_regions, build-2023, build-2023-dataai, refefences_regions
99
manager: nitinme
1010
author: mrbullwinkle #ChrisHMSFT
@@ -43,7 +43,7 @@ Azure OpenAI Service is powered by a diverse set of models with different capabi
4343

4444
| Model ID | Description | Context Window | Max Output Tokens | Training Data (up to) |
4545
| --- | :--- |:--- |:---|:---: |
46-
| `gpt-4.1` (2025-04-14) <br> <br> **Latest model from Azure OpenAI** | - Text & image input <br> - Text output <br> - Chat completions API <br>- Responses API <br> - Streaming <br> - Function calling <br> Structured outputs (chat completions) | 1,047,576 | 32,768 | May 31, 2024 |
46+
| `gpt-4.1` (2025-04-14) | - Text & image input <br> - Text output <br> - Chat completions API <br>- Responses API <br> - Streaming <br> - Function calling <br> Structured outputs (chat completions) | 1,047,576 | 32,768 | May 31, 2024 |
4747
| `gpt-4.1-nano` (2025-04-14) <br><br> **Fastest 4.1 model** | - Text & image input <br> - Text output <br> - Chat completions API <br>- Responses API <br> - Streaming <br> - Function calling <br> Structured outputs (chat completions) | 1,047,576 | 32,768 | May 31, 2024 |
4848
| `gpt-4.1-mini` (2025-04-14) | - Text & image input <br> - Text output <br> - Chat completions API <br>- Responses API <br> - Streaming <br> - Function calling <br> Structured outputs (chat completions) | 1,047,576 | 32,768 | May 31, 2024 |
4949

@@ -99,8 +99,10 @@ The Azure OpenAI o<sup>&#42;</sup> series models are specifically designed to ta
9999

100100
| Model ID | Description | Max Request (tokens) | Training Data (up to) |
101101
| --- | :--- |:--- |:---: |
102-
| `o3-mini` (2025-01-31) | The latest reasoning model, offering [enhanced reasoning abilities](../how-to/reasoning.md). <br> - Structured outputs<br> - Text-only processing <br> - Functions/Tools | Input: 200,000 <br> Output: 100,000 | Oct 2023 |
103-
| `o1` (2024-12-17) | The most capable model in the o1 series, offering [enhanced reasoning abilities](../how-to/reasoning.md). <br> - Structured outputs<br> - Text, image processing <br> - Functions/Tools | Input: 200,000 <br> Output: 100,000 | Oct 2023 |
102+
| `o4-mini` (2025-04-16) | - **NEW** reasoning model, offering [enhanced reasoning abilities](../how-to/reasoning.md). <br><br> - Chat Completions API <br> - [Responses API](../how-to/responses.md) (**Feature coming soon!**) <br>- Structured outputs<br> - Text, image processing <br> - Functions/Tools/Parallel tool calling <br> [Full summary of capabilities](../how-to/reasoning.md) | Input: 200,000 <br> Output: 100,000 | May 31, 2024 |
103+
| `o3` (2025-04-16) | - **NEW** reasoning model, offering [enhanced reasoning abilities](../how-to/reasoning.md). <br> <br> - Chat Completions API <br> - [Responses API](../how-to/responses.md) (**Feature coming soon!**) <br> - Structured outputs<br> - Text, image processing <br> - Functions/Tools/Parallel tool calling <br> [Full summary of capabilities](../how-to/reasoning.md) | Input: 200,000 <br> Output: 100,000 | May 31, 2024 |
104+
| `o3-mini` (2025-01-31) | - [Enhanced reasoning abilities](../how-to/reasoning.md). <br> - Structured outputs<br> - Text-only processing <br> - Functions/Tools | Input: 200,000 <br> Output: 100,000 | Oct 2023 |
105+
| `o1` (2024-12-17) | - [Enhanced reasoning abilities](../how-to/reasoning.md). <br> - Structured outputs<br> - Text, image processing <br> - Functions/Tools | Input: 200,000 <br> Output: 100,000 | Oct 2023 |
104106
|`o1-preview` (2024-09-12) | Older preview version | Input: 128,000 <br> Output: 32,768 | Oct 2023 |
105107
| `o1-mini` (2024-09-12) | A faster and more cost-efficient option in the o1 series, ideal for coding tasks requiring speed and lower resource consumption. <br><br> Global standard deployment available by default. <br> <br> Standard (regional) deployments are currently only available for select customers who received access as part of the `o1-preview` limited access release. | Input: 128,000 <br> Output: 65,536 | Oct 2023 |
106108

@@ -112,6 +114,8 @@ To learn more about the advanced `o-series` models see, [getting started with re
112114

113115
| Model | Region |
114116
|---|---|
117+
|`o4-mini`| East US2 (Global Standard) <br> Sweden Central (Global Standard) |
118+
| `o3` | East US2 (Global Standard) <br> Sweden Central (Global Standard) |
115119
|`o3-mini` | See the [models table](#model-summary-table-and-region-availability). |
116120
|`o1` | See the [models table](#model-summary-table-and-region-availability). |
117121
| `o1-preview` | See the [models table](#model-summary-table-and-region-availability). This model is only available for customers who were granted access as part of the original limited access |

articles/ai-services/openai/how-to/function-calling.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ ms.author: mbullwin #delegenz
77
ms.service: azure-ai-openai
88
ms.custom: devx-track-python
99
ms.topic: how-to
10-
ms.date: 04/14/2025
10+
ms.date: 04/16/2025
1111
manager: nitinme
1212
---
1313

@@ -43,6 +43,8 @@ At a high level you can break down working with functions into three steps:
4343
* `gpt-4.1` (`2025-04-14`)
4444
* `gpt-4.1-nano` (`2025-04-14`)
4545
* `gpt-4.1-mini` (`2025-04-14`)
46+
* `o4-mini` (`2025-04-16`)
47+
* `o3` (`2025-04-16`)
4648

4749
Support for parallel function was first added in API version [`2023-12-01-preview`](https://github.com/Azure/azure-rest-api-specs/blob/main/specification/cognitiveservices/data-plane/AzureOpenAI/inference/preview/2023-12-01-preview/inference.json)
4850

articles/ai-services/openai/how-to/reasoning.md

Lines changed: 21 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ description: Learn how to use Azure OpenAI's advanced o3-mini, o1, & o1-mini rea
55
manager: nitinme
66
ms.service: azure-ai-openai
77
ms.topic: include
8-
ms.date: 03/07/2025
8+
ms.date: 04/16/2025
99
author: mrbullwinkle
1010
ms.author: mbullwin
1111
---
@@ -28,37 +28,42 @@ Azure OpenAI `o-series` models are designed to tackle reasoning and problem-solv
2828

2929
| Model | Region | Limited access |
3030
|---|---|---|
31+
| `o4-mini` | East US2 (Global Standard) <br><br> Sweden Central (Global Standard) | No access request needed to use the core capabilities of this model.<br><br> Request access: [o4-mini reasoning summary feature](https://aka.ms/oai/o3access) |
32+
| `o3` | East US2 (Global Standard) <br><br> Sweden Central (Global Standard) | Request access: [o3 limited access model application](https://aka.ms/oai/o3access) |
3133
| `o3-mini` | [Model availability](../concepts/models.md#global-standard-model-availability). | Access is no longer restricted for this model. |
3234
|`o1` | [Model availability](../concepts/models.md#global-standard-model-availability). | Access is no longer restricted for this model. |
3335
| `o1-preview` | [Model availability](../concepts/models.md#global-standard-model-availability). |This model is only available for customers who were granted access as part of the original limited access release. We're currently not expanding access to `o1-preview`. |
3436
| `o1-mini` | [Model availability](../concepts/models.md#global-standard-model-availability). | No access request needed for Global Standard deployments.<br><br>Standard (regional) deployments are currently only available to select customers who were previously granted access as part of the `o1-preview` release.|
3537

3638
## API & feature support
3739

38-
| **Feature** | **o3-mini**, **2025-01-31** |**o1**, **2024-12-17** | **o1-preview**, **2024-09-12** | **o1-mini**, **2024-09-12** |
39-
|:-------------------|:--------------------------:|:--------------------------:|:-------------------------------:|:---:|
40-
| **API Version** | `2024-12-01-preview` or later <br> `2025-03-01-preview` (Recommended) | `2024-12-01-preview` or later <br> `2025-03-01-preview` (Recommended) | `2024-09-01-preview` or later <br> `2025-03-01-preview` (Recommended) | `2024-09-01-preview` or later <br> `2025-03-01-preview` (Recommended) |
41-
| **[Developer Messages](#developer-messages)** ||| - | - |
42-
| **[Structured Outputs](./structured-outputs.md)** ||| - | - |
43-
| **[Context Window](../concepts/models.md#o-series-models)** | Input: 200,000 <br> Output: 100,000 | Input: 200,000 <br> Output: 100,000 | Input: 128,000 <br> Output: 32,768 | Input: 128,000 <br> Output: 65,536 |
44-
| **[Reasoning effort](#reasoning-effort)** ||| - | - |
45-
| **[Vision Support](./gpt-with-vision.md)** | - || - | - |
46-
| Functions/Tools ||| - | - |
47-
| `max_completion_tokens`<sup>*</sup> |||||
48-
| System Messages<sup>**</sup> ||| - | - |
49-
| Streaming || - | - | - |
40+
| **Feature** | **o4-mini**, **2025-04-16** | **o3**, **2025-04-16** | **o3-mini**, **2025-01-31** |**o1**, **2024-12-17** | **o1-preview**, **2024-09-12** | **o1-mini**, **2024-09-12** |
41+
|:-------------------|:--------------------------:|:-----:|:-------:|:--------------------------:|:-------------------------------:|:---:|
42+
| **API Version** | `2025-03-01-preview` | `2025-03-01-preview` | `2024-12-01-preview` or later <br> `2025-03-01-preview` (Recommended) | `2024-12-01-preview` or later <br> `2025-03-01-preview` (Recommended) | `2024-09-01-preview` or later <br> `2025-03-01-preview` (Recommended) | `2024-09-01-preview` or later <br> `2025-03-01-preview` (Recommended) |
43+
| **[Developer Messages](#developer-messages)** ||||| - | - |
44+
| **[Structured Outputs](./structured-outputs.md)** ||||| - | - |
45+
| **[Context Window](../concepts/models.md#o-series-models)** | Input: 200,000 <br> Output: 100,000 | Input: 200,000 <br> Output: 100,000 | Input: 200,000 <br> Output: 100,000 | Input: 200,000 <br> Output: 100,000 | Input: 128,000 <br> Output: 32,768 | Input: 128,000 <br> Output: 65,536 |
46+
| **[Reasoning effort](#reasoning-effort)** ||||| - | - |
47+
| **[Vision Support](./gpt-with-vision.md)** ||| - || - | - |
48+
| Chat Completions API |||||||
49+
| Responses API | ✅ (**Feature coming soon!**) | ✅ (**Feature coming soon!**) | - | - | - | - |
50+
| Functions/Tools ||||| - | - |
51+
| Parallel Tool Calls ||| - | - | - | - |
52+
| `max_completion_tokens`<sup>*</sup> |||||||
53+
| System Messages<sup>**</sup> ||||| - | - |
54+
| Reasoning summary <sup>***</sup> | ✅ (**Feature coming soon!**) | ✅ (**Feature coming soon!**) | - | - | - | - |
55+
| Streaming |||| - | - | - |
5056

5157
<sup>*</sup> Reasoning models will only work with the `max_completion_tokens` parameter. <br><br>
5258

53-
<sup>**</sup>The latest o<sup>&#42;</sup> series model support system messages to make migration easier. When you use a system message with `o3-mini` and `o1` it will be treated as a developer message. You should not use both a developer message and a system message in the same API request.
54-
59+
<sup>**</sup>The latest o<sup>&#42;</sup> series model support system messages to make migration easier. When you use a system message with `o4-mini`, `o3`, `o3-mini`, and `o1` it will be treated as a developer message. You should not use both a developer message and a system message in the same API request.
5560

61+
<sup>***</sup> Access to the chain-of-thought reasoning summary is limited access only for `o4-mini`.
5662

5763
### Not Supported
5864

5965
The following are currently unsupported with reasoning models:
6066

61-
- Parallel tool calling
6267
- `temperature`, `top_p`, `presence_penalty`, `frequency_penalty`, `logprobs`, `top_logprobs`, `logit_bias`, `max_tokens`
6368

6469
## Usage

articles/ai-services/openai/how-to/structured-outputs.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ services: cognitive-services
66
manager: nitinme
77
ms.service: azure-ai-openai
88
ms.topic: how-to
9-
ms.date: 04/14/2025
9+
ms.date: 04/16/2025
1010
author: mrbullwinkle
1111
ms.author: mbullwin
1212
recommendations: false
@@ -34,6 +34,8 @@ Structured outputs make a model follow a [JSON Schema](https://json-schema.org/o
3434
- `gpt-4.1` version `2025-04-14`
3535
- `gpt-4.1-nano` version `2025-04-14`
3636
- `gpt-4.1-mini` version: `2025-04-14`
37+
- `o4-mini` version: `2025-04-16`
38+
- `o3` version: `2025-04-16`
3739

3840
## API support
3941

articles/ai-services/openai/overview.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -7,20 +7,20 @@ author: mrbullwinkle
77
ms.author: mbullwin
88
ms.service: azure-ai-openai
99
ms.topic: overview
10-
ms.date: 04/14/2025
10+
ms.date: 04/16/2025
1111
ms.custom: build-2023, build-2023-dataai
1212
recommendations: false
1313
---
1414

1515
# What is Azure OpenAI Service?
1616

17-
Azure OpenAI Service provides REST API access to OpenAI's powerful language models including gpt-4.1, o3-mini, o1, o1-mini, GPT-4o, GPT-4o mini, GPT-4 Turbo with Vision, GPT-4, GPT-3.5-Turbo, and Embeddings model series. These models can be easily adapted to your specific task including but not limited to content generation, summarization, image understanding, semantic search, and natural language to code translation. Users can access the service through REST APIs, [Python/C#/JS/Java/Go SDKs](/azure/ai-services/openai/supported-languages).
17+
Azure OpenAI Service provides REST API access to OpenAI's powerful language models including o4-mini, o3, gpt-4.1, o3-mini, o1, o1-mini, GPT-4o, GPT-4o mini, GPT-4 Turbo with Vision, GPT-4, GPT-3.5-Turbo, and Embeddings model series. These models can be easily adapted to your specific task including but not limited to content generation, summarization, image understanding, semantic search, and natural language to code translation. Users can access the service through REST APIs, [Python/C#/JS/Java/Go SDKs](/azure/ai-services/openai/supported-languages).
1818

1919
### Features overview
2020

2121
| Feature | Azure OpenAI |
2222
| --- | --- |
23-
| Models available | [gpt-4.1](./concepts/models.md#gpt-41-series) <br> [**computer-use-preview**](./concepts/models.md#computer-use-preview)<br>[**o3-mini & o1**](./how-to/reasoning.md) <br>[**o1-mini**](./how-to/reasoning.md)<br>**GPT-4o & GPT-4o mini**<br> **GPT-4 series (including GPT-4 Turbo with Vision)** <br>**GPT-3.5-Turbo series**<br> Embeddings series <br> Learn more in our [Models](./concepts/models.md) page.|
23+
| Models available | [o4-mini & o3](./how-to/reasoning.md) <br>[gpt-4.1](./concepts/models.md#gpt-41-series) <br> [**computer-use-preview**](./concepts/models.md#computer-use-preview)<br>[**o3-mini & o1**](./how-to/reasoning.md) <br>[**o1-mini**](./how-to/reasoning.md)<br>**GPT-4o & GPT-4o mini**<br> **GPT-4 series (including GPT-4 Turbo with Vision)** <br>**GPT-3.5-Turbo series**<br> Embeddings series <br> Learn more in our [Models](./concepts/models.md) page.|
2424
| Fine-tuning | `GPT-4o-mini` (preview) <br> `GPT-4` (preview) <br>`GPT-3.5-Turbo` (0613).|
2525
| Price | [Available here](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/) <br> For details on vision-enabled chat models, see the [special pricing information](../openai/concepts/gpt-with-vision.md#special-pricing-information).|
2626
| Virtual network support & private link support | Yes. |

articles/ai-services/openai/quotas-limits.md

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -93,6 +93,8 @@ The following sections provide you with a quick guide to the default quotas and
9393
>
9494
> - **Older chat models:** 1 unit of capacity = 6 RPM and 1,000 TPM.
9595
> - **o1 & o1-preview:** 1 unit of capacity = 1 RPM and 6,000 TPM.
96+
> - **o3** 1 unit of capacity = 1 RPM per 1,000 TPM
97+
> - **o4-mini** 1 unit of capacity = 1 RPM per 1,000 TPM
9698
> - **o3-mini:** 1 unit of capacity = 1 RPM per 10,000 TPM.
9799
> - **o1-mini:** 1 unit of capacity = 1 RPM per 10,000 TPM.
98100
>
@@ -104,9 +106,13 @@ The following sections provide you with a quick guide to the default quotas and
104106

105107
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
106108
|---|---|:---:|:---:|
109+
| `o4-mini` | Enterprise agreement | 10 M | 10 K |
110+
| `o3` | Enterprise agreement | 10 M | 10 K |
107111
| `o3-mini` | Enterprise agreement | 50 M | 5 K |
108112
| `o1` & `o1-preview` | Enterprise agreement | 30 M | 5 K |
109113
| `o1-mini`| Enterprise agreement | 50 M | 5 K |
114+
| `o4-mini` | Default | 1 M | 1 K |
115+
| `o3` | Default | 1 M | 1 K |
110116
| `o3-mini` | Default | 5 M | 500 |
111117
| `o1` & `o1-preview` | Default | 3 M | 500 |
112118
| `o1-mini`| Default | 5 M | 500 |

articles/ai-services/openai/whats-new.md

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@ ms.custom:
1111
- references_regions
1212
- ignite-2024
1313
ms.topic: whats-new
14-
ms.date: 04/14/2025
14+
ms.date: 04/16/2025
1515
recommendations: false
1616
---
1717

@@ -21,6 +21,10 @@ This article provides a summary of the latest releases and major documentation u
2121

2222
## April 2025
2323

24+
### o4-mini and o3 models released
25+
26+
`o4-mini` and `o3` models are now available. These are the latest reasoning models from Azure OpenAI offering significantly enhanced reasoning, quality, and performance. For more information, see the [getting started with reasoning models page](./how-to/reasoning.md).
27+
2428
### GPT-4.1 released
2529

2630
GPT 4.1 and GPT 4.1-nano are now available. These are the latest models from Azure OpenAI. GPT 4.1 has a 1 million token context limit. For more information, see the [models page](./concepts/models.md#gpt-41-series).

0 commit comments

Comments
 (0)