Skip to content

Commit 524a8ae

Browse files
Merge pull request #6442 from mrbullwinkle/mrb_08_06_2025_vnext_007
[Azure OpenAI] [Release Branch] 007
2 parents 0bd2db1 + 07c7f07 commit 524a8ae

File tree

7 files changed

+46
-8
lines changed

7 files changed

+46
-8
lines changed

articles/ai-foundry/openai/concepts/model-router.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -28,12 +28,14 @@ If you select **Auto-update** at the deployment step (see [Manage models](/azure
2828

2929
## Underlying models
3030

31+
> [!NOTE]
32+
> The latest version of model router is currently limited access only. You can request access using the `gpt-5 access` form: [gpt-5 limited access model application](https://aka.ms/oai/gpt5access). If you already have `o3 access` no request is required.
33+
3134
|Model router version|Underlying models| Model version
3235
|:---:|:---|:----:|
3336
| `2025-08-07` | `gpt-4.1` </br>`gpt-4.1-mini` </br>`gpt-4.1-nano` </br>`o4-mini` </br> `gpt-5` <br> `gpt-5-mini` <br> `gpt-5-nano` <br> `gpt-5-chat` | `2025-04-14` <br> `2025-04-14` <br> `2025-04-14` <br> `2025-04-16` <br> `2025-08-07` <br> `2025-08-07` <br> `2025-08-07` <br> `2025-08-07` |
3437
|`2025-05-19`| `gpt-4.1` </br>`gpt-4.1-mini` </br>`gpt-4.1-nano` </br>`o4-mini` | `2025-04-14` <br> `2025-04-14` <br> `2025-04-14` <br> `2025-04-16` |
3538

36-
3739
## Limitations
3840

3941
### Resource limitations

articles/ai-foundry/openai/concepts/models.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -42,9 +42,9 @@ Azure OpenAI is powered by a diverse set of models with different capabilities a
4242

4343
| Model | Region |
4444
|---|---|
45-
| `gpt-5` (2025-08-07) | East US 2 (Global Standard), Sweden Central (Global Standard)|
46-
| `gpt-5-mini` (2025-08-07) | East US 2 (Global Standard), Sweden Central (Global Standard)|
47-
| `gpt-5-nano` (2025-08-07) | East US 2 (Global Standard), Sweden Central (Global Standard)|
45+
| `gpt-5` (2025-08-07) | East US 2 (Global Standard & Data Zones), Sweden Central (Global Standard & Data Zones)|
46+
| `gpt-5-mini` (2025-08-07) | East US 2 (Global Standard & Data Zones), Sweden Central (Global Standard & Data Zones)|
47+
| `gpt-5-nano` (2025-08-07) | East US 2 (Global Standard & Data Zones), Sweden Central (Global Standard & Data Zones)|
4848
| `gpt-5-chat` (2025-08-07) | East US 2 (Global Standard), Sweden Central (Global Standard)|
4949

5050
- **[Registration is required for access to the gpt-5 model](https://aka.ms/oai/gpt5access).**

articles/ai-foundry/openai/how-to/function-calling.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -41,6 +41,9 @@ At a high level you can break down working with functions into three steps:
4141
* `gpt-4o-mini` (`2024-07-18`)
4242
* `gpt-4.1` (`2025-04-14`)
4343
* `gpt-4.1-mini` (`2025-04-14`)
44+
* `gpt-5` (`2025-08-07`)
45+
* `gpt-5-mini` (`2025-08-07`)
46+
* `gpt-5-nano` (`2025-08-07`)
4447

4548
Support for parallel function was first added in API version [`2023-12-01-preview`](https://github.com/Azure/azure-rest-api-specs/blob/main/specification/cognitiveservices/data-plane/AzureOpenAI/inference/preview/2023-12-01-preview/inference.json)
4649

articles/ai-foundry/openai/how-to/reasoning.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -28,9 +28,9 @@ Azure OpenAI reasoning models are designed to tackle reasoning and problem-solvi
2828

2929
| Model | Region | Limited access |
3030
|---|---|---|
31-
| `gpt-5` | East US2 & Sweden Central (Global Standard) | Request access: [gpt-5 limited access model application](https://aka.ms/oai/gpt5access). If you already have `o3 access` no request is required |
32-
| `gpt-5-mini` | East US2 & Sweden Central (Global Standard) | No access request needed to use the core capabilities of this model |
33-
| `gpt-5-nano` | East US2 & Sweden Central (Global Standard) | No access request needed to use the core capabilities of this model |
31+
| `gpt-5` | East US2 & Sweden Central (Global Standard & Data Zones) | Request access: [gpt-5 limited access model application](https://aka.ms/oai/gpt5access). If you already have `o3 access` no request is required |
32+
| `gpt-5-mini` | East US2 & Sweden Central (Global Standard & Data Zones) | No access request needed to use the core capabilities of this model |
33+
| `gpt-5-nano` | East US2 & Sweden Central (Global Standard & Data Zones) | No access request needed to use the core capabilities of this model |
3434
| `o3-pro` | East US2 & Sweden Central (Global Standard) | Request access: [o3 limited access model application](https://aka.ms/oai/o3access). If you already have `o3 access` no request is required. |
3535
| `codex-mini` | East US2 & Sweden Central (Global Standard) | No access request needed. |
3636
| `o4-mini` | [Model availability](../concepts/models.md#global-standard-model-availability) | No access request needed to use the core capabilities of this model.<br><br> Request access: [o4-mini reasoning summary feature](https://aka.ms/oai/o3access) |

articles/ai-foundry/openai/how-to/structured-outputs.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -25,6 +25,9 @@ Structured outputs make a model follow a [JSON Schema](https://json-schema.org/o
2525
2626
## Supported models
2727

28+
- `gpt-5` version `2025-08-07`
29+
- `gpt-5-mini` version `2025-08-07`
30+
- `gpt-5-nano` version `2025-08-07`
2831
- `codex-mini` version `2025-05-16`
2932
- `o3-pro` version `2025-06-10`
3033
- `o3-mini` version `2025-01-31`

articles/ai-foundry/openai/quotas-limits.md

Lines changed: 11 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ description: This article features detailed descriptions and best practices on t
44
author: mrbullwinkle
55
ms.author: mbullwin
66
manager: nitinme
7-
ms.date: 07/11/2025
7+
ms.date: 08/07/2025
88
ms.service: azure-ai-openai
99
ms.topic: conceptual
1010
ms.custom:
@@ -71,6 +71,16 @@ The following section provides you with a quick guide to the default quotas and
7171
> [!NOTE]
7272
> Quota limits are subject to change.
7373
74+
## GPT-5 Series
75+
76+
| Model | Global Default<br>Tokens per minute (TPM) | Global Enterprise and MCA-E <br>Tokens per minute (TPM) | Data Zone Default <br>Tokens per minute (TPM) | Data Zone Enterprise and MCA-E <br>Tokens per minute (TPM) |
77+
|-------------|----------------|-------------------|-------------------|----------------------|
78+
| gpt-5 | 1 M | 10 M | 300 K | 3 M |
79+
| gpt-5-mini | 1 M | 10 M | 300 K | 3 M |
80+
| gpt-5-nano | 5 M | 150 M | 2 M | 50 M |
81+
| gpt-5-chat | 1 M | 5 M | N/A | N/A |
82+
83+
7484
[!INCLUDE [Quota](./includes/global-batch-limits.md)]
7585

7686
## gpt-oss

articles/ai-foundry/openai/whats-new.md

Lines changed: 20 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -18,6 +18,26 @@ ms.custom:
1818

1919
This article provides a summary of the latest releases and major documentation updates for Azure OpenAI.
2020

21+
## August 2025
22+
23+
### GPT-5 models available
24+
25+
- `gpt-5`, `gpt-5-mini`, `gpt-5-nano` To learn more, see the [getting started with reasoning models page](./how-to/reasoning.md).
26+
- `gpt-5-chat` is now available. To learn more, see the [models page](./concepts/models.md)
27+
28+
- **[Registration is required for access to the gpt-5 model](https://aka.ms/oai/gpt5access).**
29+
30+
- `gpt-5-mini`, `gpt-5-nano`, and `gpt-5-chat` do not require registration.
31+
32+
### New version of model-router
33+
34+
- Model router nows supports GPT-5 series models.
35+
36+
-The latest version of model router is currently limited access only. You can request access using the `gpt-5 access` form: [gpt-5 limited access model application](https://aka.ms/oai/gpt5access). If you already have `o3 access` no request is required.
37+
38+
- Model router for Azure AI Foundry is a deployable AI chat model that automatically selects the best underlying chat model to respond to a given prompt. For more information on how model router works and its advantages and limitations, see the [Model router concepts guide](./concepts/model-router.md). To use model router with the Completions API, follow the [How-to guide](./concepts/model-router.md).
39+
40+
2141
## July 2025
2242

2343
### GPT-image-1 update (preview)

0 commit comments

Comments
 (0)