Skip to content

Commit dad2f57

Browse files
authored
Merge pull request #4183 from MicrosoftDocs/main
OOB Publish -- 4/16 - 3:15 PM PST
2 parents 780c79e + 20af0cd commit dad2f57

File tree

10 files changed

+77
-54
lines changed

10 files changed

+77
-54
lines changed

articles/ai-services/openai/concepts/models.md

Lines changed: 8 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ titleSuffix: Azure OpenAI
44
description: Learn about the different model capabilities that are available with Azure OpenAI.
55
ms.service: azure-ai-openai
66
ms.topic: conceptual
7-
ms.date: 04/15/2025
7+
ms.date: 04/16/2025
88
ms.custom: references_regions, build-2023, build-2023-dataai, refefences_regions
99
manager: nitinme
1010
author: mrbullwinkle #ChrisHMSFT
@@ -43,7 +43,7 @@ Azure OpenAI Service is powered by a diverse set of models with different capabi
4343

4444
| Model ID | Description | Context Window | Max Output Tokens | Training Data (up to) |
4545
| --- | :--- |:--- |:---|:---: |
46-
| `gpt-4.1` (2025-04-14) <br> <br> **Latest model from Azure OpenAI** | - Text & image input <br> - Text output <br> - Chat completions API <br>- Responses API <br> - Streaming <br> - Function calling <br> Structured outputs (chat completions) | 1,047,576 | 32,768 | May 31, 2024 |
46+
| `gpt-4.1` (2025-04-14) | - Text & image input <br> - Text output <br> - Chat completions API <br>- Responses API <br> - Streaming <br> - Function calling <br> Structured outputs (chat completions) | 1,047,576 | 32,768 | May 31, 2024 |
4747
| `gpt-4.1-nano` (2025-04-14) <br><br> **Fastest 4.1 model** | - Text & image input <br> - Text output <br> - Chat completions API <br>- Responses API <br> - Streaming <br> - Function calling <br> Structured outputs (chat completions) | 1,047,576 | 32,768 | May 31, 2024 |
4848
| `gpt-4.1-mini` (2025-04-14) | - Text & image input <br> - Text output <br> - Chat completions API <br>- Responses API <br> - Streaming <br> - Function calling <br> Structured outputs (chat completions) | 1,047,576 | 32,768 | May 31, 2024 |
4949

@@ -99,8 +99,10 @@ The Azure OpenAI o<sup>&#42;</sup> series models are specifically designed to ta
9999

100100
| Model ID | Description | Max Request (tokens) | Training Data (up to) |
101101
| --- | :--- |:--- |:---: |
102-
| `o3-mini` (2025-01-31) | The latest reasoning model, offering [enhanced reasoning abilities](../how-to/reasoning.md). <br> - Structured outputs<br> - Text-only processing <br> - Functions/Tools | Input: 200,000 <br> Output: 100,000 | Oct 2023 |
103-
| `o1` (2024-12-17) | The most capable model in the o1 series, offering [enhanced reasoning abilities](../how-to/reasoning.md). <br> - Structured outputs<br> - Text, image processing <br> - Functions/Tools | Input: 200,000 <br> Output: 100,000 | Oct 2023 |
102+
| `o4-mini` (2025-04-16) | - **NEW** reasoning model, offering [enhanced reasoning abilities](../how-to/reasoning.md). <br><br> - Chat Completions API <br> - [Responses API](../how-to/responses.md) (**Feature coming soon!**) <br>- Structured outputs<br> - Text, image processing <br> - Functions/Tools/Parallel tool calling <br> [Full summary of capabilities](../how-to/reasoning.md) | Input: 200,000 <br> Output: 100,000 | May 31, 2024 |
103+
| `o3` (2025-04-16) | - **NEW** reasoning model, offering [enhanced reasoning abilities](../how-to/reasoning.md). <br> <br> - Chat Completions API <br> - [Responses API](../how-to/responses.md) (**Feature coming soon!**) <br> - Structured outputs<br> - Text, image processing <br> - Functions/Tools/Parallel tool calling <br> [Full summary of capabilities](../how-to/reasoning.md) | Input: 200,000 <br> Output: 100,000 | May 31, 2024 |
104+
| `o3-mini` (2025-01-31) | - [Enhanced reasoning abilities](../how-to/reasoning.md). <br> - Structured outputs<br> - Text-only processing <br> - Functions/Tools | Input: 200,000 <br> Output: 100,000 | Oct 2023 |
105+
| `o1` (2024-12-17) | - [Enhanced reasoning abilities](../how-to/reasoning.md). <br> - Structured outputs<br> - Text, image processing <br> - Functions/Tools | Input: 200,000 <br> Output: 100,000 | Oct 2023 |
104106
|`o1-preview` (2024-09-12) | Older preview version | Input: 128,000 <br> Output: 32,768 | Oct 2023 |
105107
| `o1-mini` (2024-09-12) | A faster and more cost-efficient option in the o1 series, ideal for coding tasks requiring speed and lower resource consumption. <br><br> Global standard deployment available by default. <br> <br> Standard (regional) deployments are currently only available for select customers who received access as part of the `o1-preview` limited access release. | Input: 128,000 <br> Output: 65,536 | Oct 2023 |
106108

@@ -112,6 +114,8 @@ To learn more about the advanced `o-series` models see, [getting started with re
112114

113115
| Model | Region |
114116
|---|---|
117+
|`o4-mini`| East US2 (Global Standard) <br> Sweden Central (Global Standard) |
118+
| `o3` | East US2 (Global Standard) <br> Sweden Central (Global Standard) |
115119
|`o3-mini` | See the [models table](#model-summary-table-and-region-availability). |
116120
|`o1` | See the [models table](#model-summary-table-and-region-availability). |
117121
| `o1-preview` | See the [models table](#model-summary-table-and-region-availability). This model is only available for customers who were granted access as part of the original limited access |

articles/ai-services/openai/how-to/function-calling.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ ms.author: mbullwin #delegenz
77
ms.service: azure-ai-openai
88
ms.custom: devx-track-python
99
ms.topic: how-to
10-
ms.date: 04/14/2025
10+
ms.date: 04/16/2025
1111
manager: nitinme
1212
---
1313

@@ -43,6 +43,8 @@ At a high level you can break down working with functions into three steps:
4343
* `gpt-4.1` (`2025-04-14`)
4444
* `gpt-4.1-nano` (`2025-04-14`)
4545
* `gpt-4.1-mini` (`2025-04-14`)
46+
* `o4-mini` (`2025-04-16`)
47+
* `o3` (`2025-04-16`)
4648

4749
Support for parallel function was first added in API version [`2023-12-01-preview`](https://github.com/Azure/azure-rest-api-specs/blob/main/specification/cognitiveservices/data-plane/AzureOpenAI/inference/preview/2023-12-01-preview/inference.json)
4850

articles/ai-services/openai/how-to/reasoning.md

Lines changed: 21 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ description: Learn how to use Azure OpenAI's advanced o3-mini, o1, & o1-mini rea
55
manager: nitinme
66
ms.service: azure-ai-openai
77
ms.topic: include
8-
ms.date: 03/07/2025
8+
ms.date: 04/16/2025
99
author: mrbullwinkle
1010
ms.author: mbullwin
1111
---
@@ -28,37 +28,42 @@ Azure OpenAI `o-series` models are designed to tackle reasoning and problem-solv
2828

2929
| Model | Region | Limited access |
3030
|---|---|---|
31+
| `o4-mini` | East US2 (Global Standard) <br><br> Sweden Central (Global Standard) | No access request needed to use the core capabilities of this model.<br><br> Request access: [o4-mini reasoning summary feature](https://aka.ms/oai/o3access) |
32+
| `o3` | East US2 (Global Standard) <br><br> Sweden Central (Global Standard) | Request access: [o3 limited access model application](https://aka.ms/oai/o3access) |
3133
| `o3-mini` | [Model availability](../concepts/models.md#global-standard-model-availability). | Access is no longer restricted for this model. |
3234
|`o1` | [Model availability](../concepts/models.md#global-standard-model-availability). | Access is no longer restricted for this model. |
3335
| `o1-preview` | [Model availability](../concepts/models.md#global-standard-model-availability). |This model is only available for customers who were granted access as part of the original limited access release. We're currently not expanding access to `o1-preview`. |
3436
| `o1-mini` | [Model availability](../concepts/models.md#global-standard-model-availability). | No access request needed for Global Standard deployments.<br><br>Standard (regional) deployments are currently only available to select customers who were previously granted access as part of the `o1-preview` release.|
3537

3638
## API & feature support
3739

38-
| **Feature** | **o3-mini**, **2025-01-31** |**o1**, **2024-12-17** | **o1-preview**, **2024-09-12** | **o1-mini**, **2024-09-12** |
39-
|:-------------------|:--------------------------:|:--------------------------:|:-------------------------------:|:---:|
40-
| **API Version** | `2024-12-01-preview` or later <br> `2025-03-01-preview` (Recommended) | `2024-12-01-preview` or later <br> `2025-03-01-preview` (Recommended) | `2024-09-01-preview` or later <br> `2025-03-01-preview` (Recommended) | `2024-09-01-preview` or later <br> `2025-03-01-preview` (Recommended) |
41-
| **[Developer Messages](#developer-messages)** ||| - | - |
42-
| **[Structured Outputs](./structured-outputs.md)** ||| - | - |
43-
| **[Context Window](../concepts/models.md#o-series-models)** | Input: 200,000 <br> Output: 100,000 | Input: 200,000 <br> Output: 100,000 | Input: 128,000 <br> Output: 32,768 | Input: 128,000 <br> Output: 65,536 |
44-
| **[Reasoning effort](#reasoning-effort)** ||| - | - |
45-
| **[Vision Support](./gpt-with-vision.md)** | - || - | - |
46-
| Functions/Tools ||| - | - |
47-
| `max_completion_tokens`<sup>*</sup> |||||
48-
| System Messages<sup>**</sup> ||| - | - |
49-
| Streaming || - | - | - |
40+
| **Feature** | **o4-mini**, **2025-04-16** | **o3**, **2025-04-16** | **o3-mini**, **2025-01-31** |**o1**, **2024-12-17** | **o1-preview**, **2024-09-12** | **o1-mini**, **2024-09-12** |
41+
|:-------------------|:--------------------------:|:-----:|:-------:|:--------------------------:|:-------------------------------:|:---:|
42+
| **API Version** | `2025-03-01-preview` | `2025-03-01-preview` | `2024-12-01-preview` or later <br> `2025-03-01-preview` (Recommended) | `2024-12-01-preview` or later <br> `2025-03-01-preview` (Recommended) | `2024-09-01-preview` or later <br> `2025-03-01-preview` (Recommended) | `2024-09-01-preview` or later <br> `2025-03-01-preview` (Recommended) |
43+
| **[Developer Messages](#developer-messages)** ||||| - | - |
44+
| **[Structured Outputs](./structured-outputs.md)** ||||| - | - |
45+
| **[Context Window](../concepts/models.md#o-series-models)** | Input: 200,000 <br> Output: 100,000 | Input: 200,000 <br> Output: 100,000 | Input: 200,000 <br> Output: 100,000 | Input: 200,000 <br> Output: 100,000 | Input: 128,000 <br> Output: 32,768 | Input: 128,000 <br> Output: 65,536 |
46+
| **[Reasoning effort](#reasoning-effort)** ||||| - | - |
47+
| **[Vision Support](./gpt-with-vision.md)** ||| - || - | - |
48+
| Chat Completions API |||||||
49+
| Responses API | ✅ (**Feature coming soon!**) | ✅ (**Feature coming soon!**) | - | - | - | - |
50+
| Functions/Tools ||||| - | - |
51+
| Parallel Tool Calls ||| - | - | - | - |
52+
| `max_completion_tokens`<sup>*</sup> |||||||
53+
| System Messages<sup>**</sup> ||||| - | - |
54+
| Reasoning summary <sup>***</sup> | ✅ (**Feature coming soon!**) | ✅ (**Feature coming soon!**) | - | - | - | - |
55+
| Streaming |||| - | - | - |
5056

5157
<sup>*</sup> Reasoning models will only work with the `max_completion_tokens` parameter. <br><br>
5258

53-
<sup>**</sup>The latest o<sup>&#42;</sup> series model support system messages to make migration easier. When you use a system message with `o3-mini` and `o1` it will be treated as a developer message. You should not use both a developer message and a system message in the same API request.
54-
59+
<sup>**</sup>The latest o<sup>&#42;</sup> series model support system messages to make migration easier. When you use a system message with `o4-mini`, `o3`, `o3-mini`, and `o1` it will be treated as a developer message. You should not use both a developer message and a system message in the same API request.
5560

61+
<sup>***</sup> Access to the chain-of-thought reasoning summary is limited access only for `o4-mini`.
5662

5763
### Not Supported
5864

5965
The following are currently unsupported with reasoning models:
6066

61-
- Parallel tool calling
6267
- `temperature`, `top_p`, `presence_penalty`, `frequency_penalty`, `logprobs`, `top_logprobs`, `logit_bias`, `max_tokens`
6368

6469
## Usage

articles/ai-services/openai/how-to/structured-outputs.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ services: cognitive-services
66
manager: nitinme
77
ms.service: azure-ai-openai
88
ms.topic: how-to
9-
ms.date: 04/14/2025
9+
ms.date: 04/16/2025
1010
author: mrbullwinkle
1111
ms.author: mbullwin
1212
recommendations: false
@@ -34,6 +34,8 @@ Structured outputs make a model follow a [JSON Schema](https://json-schema.org/o
3434
- `gpt-4.1` version `2025-04-14`
3535
- `gpt-4.1-nano` version `2025-04-14`
3636
- `gpt-4.1-mini` version: `2025-04-14`
37+
- `o4-mini` version: `2025-04-16`
38+
- `o3` version: `2025-04-16`
3739

3840
## API support
3941

0 commit comments

Comments
 (0)