Skip to content

Commit 3db78c4

Browse files
authored
Merge pull request #7224 from MicrosoftDocs/release-2025-openai-sep-15
[Release Branch --> Main tracking branch] - release-2025-openai-sep-15
2 parents 74368b4 + a39c14b commit 3db78c4

File tree

3 files changed

+41
-35
lines changed

3 files changed

+41
-35
lines changed

articles/ai-foundry/openai/how-to/reasoning.md

Lines changed: 24 additions & 23 deletions
Original file line numberDiff line numberDiff line change
@@ -28,13 +28,14 @@ Azure OpenAI reasoning models are designed to tackle reasoning and problem-solvi
2828

2929
| Model | Region | Limited access |
3030
|---|---|---|
31-
| `gpt-5` | [Model availability](../concepts/models.md#global-standard-model-availability) | Request access: [gpt-5 limited access model application](https://aka.ms/oai/gpt5access). If you already have `o3 access` no request is required |
31+
| `gpt-5-codex` | East US2 & Sweden Central (Global Standard) | Request access: [Limited access model application](https://aka.ms/oai/gpt5access) |
32+
| `gpt-5` | [Model availability](../concepts/models.md#global-standard-model-availability) | Request access: [Limited access model application](https://aka.ms/oai/gpt5access). If you already have `o3 access` no request is required |
3233
| `gpt-5-mini` | [Model availability](../concepts/models.md#global-standard-model-availability) | No access request needed. |
3334
| `gpt-5-nano` | [Model availability](../concepts/models.md#global-standard-model-availability) | No access request needed. |
34-
| `o3-pro` | East US2 & Sweden Central (Global Standard) | Request access: [o3 limited access model application](https://aka.ms/oai/o3access). If you already have `o3 access` no request is required. |
35+
| `o3-pro` | East US2 & Sweden Central (Global Standard) | Request access: [Limited access model application](https://aka.ms/oai/o3access). If you already have `o3 access` no request is required. |
3536
| `codex-mini` | East US2 & Sweden Central (Global Standard) | No access request needed. |
3637
| `o4-mini` | [Model availability](../concepts/models.md#global-standard-model-availability) | No access request needed to use the core capabilities of this model.<br><br> Request access: [o4-mini reasoning summary feature](https://aka.ms/oai/o3access) |
37-
| `o3` | [Model availability](../concepts/models.md#global-standard-model-availability) | Request access: [o3 limited access model application](https://aka.ms/oai/o3access) |
38+
| `o3` | [Model availability](../concepts/models.md#global-standard-model-availability) | Request access: [Limited access model application](https://aka.ms/oai/o3access) |
3839
| `o3-mini` | [Model availability](../concepts/models.md#global-standard-model-availability). | Access is no longer restricted for this model. |
3940
|`o1` | [Model availability](../concepts/models.md#global-standard-model-availability). | Access is no longer restricted for this model. |
4041
| `o1-mini` | [Model availability](../concepts/models.md#global-standard-model-availability). | No access request needed for Global Standard deployments.<br><br>Standard (regional) deployments are currently only available to select customers who were previously granted access as part of the `o1-preview` release.|
@@ -43,40 +44,40 @@ Azure OpenAI reasoning models are designed to tackle reasoning and problem-solvi
4344

4445
# [GPT-5 Reasoning Models](#tab/gpt-5)
4546

46-
47-
| **Feature** | **gpt-5**, **2025-08-07** | **gpt-5-mini**, **2025-08-07** | **gpt-5-nano**, **2025-08-07** |
48-
|:-------------------|:--------------------------:|:------:|:--------:|
49-
| **API Version** | [v1](../api-version-lifecycle.md#api-evolution) | [v1](../api-version-lifecycle.md#api-evolution) | [v1](../api-version-lifecycle.md#api-evolution) |
50-
| **[Developer Messages](#developer-messages)** ||||
51-
| **[Structured Outputs](./structured-outputs.md)** ||||
52-
| **[Context Window](../concepts/models.md#o-series-models)** | 400,000 <br><br>Input: 272,000 <br> Output: 128,000 | 400,000 <br><br> Input: 272,000 <br> Output: 128,000 | 400,000 <br><br> Input: 272,000 <br> Output: 128,000 |
53-
| **[Reasoning effort](#reasoning-effort)** ||||
54-
| **[Image input](./gpt-with-vision.md)** ||||
55-
| Chat Completions API ||||
56-
| Responses API ||||
57-
| Functions/Tools ||||
58-
| Parallel Tool Calls<sup>1</sup> ||||
59-
| `max_completion_tokens` <sup>2</sup> ||||
60-
| System Messages <sup>3</sup> ||||
61-
| [Reasoning summary](#reasoning-summary) ||||
62-
| Streaming ||||
47+
| **Feature** | **gpt-5-codex**, **2025-09-011** | **gpt-5**, **2025-08-07** | **gpt-5-mini**, **2025-08-07** | **gpt-5-nano**, **2025-08-07** |
48+
|:-------------------|:--------------------------:|:--------------------------:|:------:|:--------:|
49+
| **API Version** | [v1](../api-version-lifecycle.md#api-evolution) | [v1](../api-version-lifecycle.md#api-evolution) | [v1](../api-version-lifecycle.md#api-evolution) | [v1](../api-version-lifecycle.md#api-evolution) |
50+
| **[Developer Messages](#developer-messages)** |||||
51+
| **[Structured Outputs](./structured-outputs.md)** |||||
52+
| **[Context Window](../concepts/models.md#o-series-models)** | 400,000 <br><br>Input: 272,000 <br> Output: 128,000 | 400,000 <br><br>Input: 272,000 <br> Output: 128,000 | 400,000 <br><br> Input: 272,000 <br> Output: 128,000 | 400,000 <br><br> Input: 272,000 <br> Output: 128,000 |
53+
| **[Reasoning effort](#reasoning-effort)** |||||
54+
| **[Image input](./gpt-with-vision.md)** |||||
55+
| Chat Completions API | - ||||
56+
| Responses API |||||
57+
| Functions/Tools |||||
58+
| Parallel Tool Calls<sup>1</sup> |||||
59+
| `max_completion_tokens` <sup>2</sup> | - ||||
60+
| System Messages <sup>3</sup> |||||
61+
| [Reasoning summary](#reasoning-summary) |||||
62+
| Streaming |||||
6363

6464
<sup>1</sup> Parallel tool calls are not supported when `reasoning_effort` is set to `minimal`<br><br>
6565
<sup>2</sup> Reasoning models will only work with the `max_completion_tokens` parameter when using the Chat Completions API. Use `max_output_tokens` with the Responses API. <br><br>
6666
<sup>3</sup> The latest reasoning models support system messages to make migration easier. You should not use both a developer message and a system message in the same API request.<br><br>
6767

68-
6968
### NEW GPT-5 reasoning features
7069

7170
| Feature | Description |
7271
|----|----|
73-
|`reasoning_effort` | `minimal` is now supported with GPT-5 series reasoning models <br><br> **Options**: `minimal`, `low`, `medium`, `high`|
74-
|`verbosity` | A new parameter giving you more granular control over how concise the model's output will be.<br><br>**Options:** `low`, `medium`, `high`. |
72+
|`reasoning_effort` | `minimal` is now supported with GPT-5 series reasoning models<sup>*</sup> <br><br> **Options**: `minimal`, `low`, `medium`, `high`|
73+
|`verbosity` | A new parameter providing more granular control over how concise the model's output will be.<br><br>**Options:** `low`, `medium`, `high`. |
7574
| `preamble` | GPT-5 series reasoning models have the ability to spend extra time *"thinking"* before executing a function/tool call.<br><br> When this planning occurs the model can provide insight into the planning steps in the model response via a new object called the `preamble` object.<br><br> Generation of preambles in the model response is not guaranteed though you can encourage the model by using the `instructions` parameter and passing content like "You MUST plan extensively before each function call. ALWAYS output your plan to the user before calling any function"|
7675
| **allowed tools** | You can specify multiple tools under `tool_choice` instead of just one. |
7776
| **custom tool type** | Enables raw text (non-json) outputs |
7877
| [`lark_tool`](#python-lark) | Allows you to use some of the capabilities of [Python lark](https://github.com/lark-parser/lark) for more flexible constraining of model responses |
7978

79+
<sup>*</sup> `gpt-5-codex` does not support `reasoning_effort` minimal.
80+
8081
For more information, we also recommend reading OpenAI's [GPT-5 prompting cookbook guide](https://cookbook.openai.com/examples/gpt-5/gpt-5_prompting_guide) and their [GPT-5 feature guide](https://platform.openai.com/docs/guides/latest-model).
8182

8283
# [O-Series Reasoning Models](#tab/o-series)

articles/ai-foundry/openai/includes/models-azure-direct-openai.md

Lines changed: 6 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -36,19 +36,21 @@ ms.topic: include
3636
| `gpt-5-mini` (2025-08-07) | See the [models table](#model-summary-table-and-region-availability).|
3737
| `gpt-5-nano` (2025-08-07) | See the [models table](#model-summary-table-and-region-availability).|
3838
| `gpt-5-chat` (2025-08-07) | See the [models table](#model-summary-table-and-region-availability).|
39+
| `gpt-5-codex` (2025-09-11) | East US2 (Global Standard) and Sweden Central (Global Standard) |
3940

40-
- **[Registration is required for access to the gpt-5 model](https://aka.ms/oai/gpt5access).**
41+
- **[Registration is required for access to the gpt-5 & gpt-5-codex models](https://aka.ms/oai/gpt5access).**
4142

4243
- `gpt-5-mini`, `gpt-5-nano`, and `gpt-5-chat` do not require registration.
4344

4445
Access will be granted based on Microsoft's eligibility criteria. Customers who previously applied and received access to `o3`, don't need to reapply as their approved subscriptions will automatically be granted access upon model release.
4546

4647
| Model ID | Description | Context Window | Max Output Tokens | Training Data (up to) |
4748
| --- | :--- |:--- |:---|:---: |
48-
| `gpt-5` (2025-08-07) | - [Reasoning](../how-to/reasoning.md) <br> - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> [Full summary of capabilities](../how-to/reasoning.md). | 400,000<br><br>Input: 272,000<br>Output: 128,000 | 128,000 | October 24, 2024 |
49-
| `gpt-5-mini` (2025-08-07) | - [Reasoning](../how-to/reasoning.md) <br> - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> [Full summary of capabilities](../how-to/reasoning.md). | 400,000<br><br>Input: 272,000<br>Output: 128,000 | 128,000 | June 24, 2024 |
50-
| `gpt-5-nano` (2025-08-07) | - [Reasoning](../how-to/reasoning.md) <br> - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> [Full summary of capabilities](../how-to/reasoning.md). | 400,000<br><br>Input: 272,000<br>Output: 128,000 | 128,000 | May 31, 2024 |
49+
| `gpt-5` (2025-08-07) | - [Reasoning](../how-to/reasoning.md) <br> - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> - [Full summary of capabilities](../how-to/reasoning.md). | 400,000<br><br>Input: 272,000<br>Output: 128,000 | 128,000 | October 24, 2024 |
50+
| `gpt-5-mini` (2025-08-07) | - [Reasoning](../how-to/reasoning.md) <br> - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> - [Full summary of capabilities](../how-to/reasoning.md). | 400,000<br><br>Input: 272,000<br>Output: 128,000 | 128,000 | June 24, 2024 |
51+
| `gpt-5-nano` (2025-08-07) | - [Reasoning](../how-to/reasoning.md) <br> - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> - [Full summary of capabilities](../how-to/reasoning.md). | 400,000<br><br>Input: 272,000<br>Output: 128,000 | 128,000 | May 31, 2024 |
5152
| `gpt-5-chat` (2025-08-07)<br>**Preview** | - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - **Input**: Text/Image <br> - **Output**: Text only | 128,000 | 16,384 | October 24, 2024 |
53+
| `gpt-5-codex` (2025-09-11) | - [Responses API](../how-to/responses.md) only. <br> - **Input**: Text/Image <br> - **Output**: Text only <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> - [Full summary of capabilities](../how-to/reasoning.md) | 400,000<br><br>Input: 272,000<br>Output: 128,000 | 128,000 | - |
5254

5355
## gpt-oss
5456

articles/ai-foundry/openai/quotas-limits.md

Lines changed: 11 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -75,17 +75,20 @@ The following section provides you with a quick guide to the default quotas and
7575

7676
| Model | Global Default<br>Tokens per minute (TPM) | Global Enterprise and MCA-E <br>Tokens per minute (TPM) | Data Zone Default <br>Tokens per minute (TPM) | Data Zone Enterprise and MCA-E <br>Tokens per minute (TPM) |
7777
|-------------|----------------|-------------------|-------------------|----------------------|
78-
| gpt-5 | 1 M | 10 M | 300 K | 3 M |
79-
| gpt-5-mini | 1 M | 10 M | 300 K | 3 M |
80-
| gpt-5-nano | 5 M | 150 M | 2 M | 50 M |
81-
| gpt-5-chat | 1 M | 5 M | N/A | N/A |
78+
| `gpt-5` | 1 M | 10 M | 300 K | 3 M |
79+
| `gpt-5-mini` | 1 M | 10 M | 300 K | 3 M |
80+
| `gpt-5-nano` | 5 M | 150 M | 2 M | 50 M |
81+
| `gpt-5-chat` | 1 M | 5 M | N/A | N/A |
82+
| `gpt-5-codex` | 1 M | 10 M | N/A | N/A |
8283

8384
| Model | Global Default<br>Requests per minute (RPM) | Global Enterprise and MCA-E <br>Requests per minute (RPM) | Data Zone Default <br>Requests per minute (RPM) | Data Zone Enterprise and MCA-E <br>Requests per minute (RPM) |
8485
|-------------|----------------------------------------------|------------------------------------------------------------|--------------------------------------------------|--------------------------------------------------------------|
85-
| gpt-5 | 10 K | 100 K | 3 K | 30 K |
86-
| gpt-5-mini | 1 K | 10 K | 300 | 3 K |
87-
| gpt-5-nano | 5 K | 150 K | 2 K | 50 K |
88-
| gpt-5-chat | 1 K | 5 K | N/A | N/A |
86+
| `gpt-5` | 10 K | 100 K | 3 K | 30 K |
87+
| `gpt-5-mini` | 1 K | 10 K | 300 | 3 K |
88+
| `gpt-5-nano` | 5 K | 150 K | 2 K | 50 K |
89+
| `gpt-5-chat` | 1 K | 5 K | N/A | N/A |
90+
| `gpt-5-codex` | 1 K | 10 K | N/A | N/A |
91+
8992

9093

9194
[!INCLUDE [Quota](./includes/global-batch-limits.md)]

0 commit comments

Comments
 (0)