Skip to content

Commit db553f3

Browse files
authored
Merge pull request #6428 from MicrosoftDocs/release-2025-openai-aug-vnext
[Azure OpenAI] [Release branch to main tracking branch]
2 parents a4f64e1 + 527b92f commit db553f3

File tree

9 files changed

+226
-54
lines changed

9 files changed

+226
-54
lines changed

articles/ai-foundry/openai/concepts/model-router.md

Lines changed: 7 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ description: Learn about the model router feature in Azure OpenAI in Azure AI Fo
55
author: PatrickFarley
66
ms.author: pafarley
77
manager: nitinme
8-
ms.date: 05/08/2025
8+
ms.date: 08/06/2025
99
ms.service: azure-ai-openai
1010
ms.topic: conceptual
1111
ms.custom:
@@ -28,10 +28,13 @@ If you select **Auto-update** at the deployment step (see [Manage models](/azure
2828

2929
## Underlying models
3030

31-
|Model router version|Underlying models (version)|
32-
|---|---|
33-
|`2025-05-19`|GPT-4.1 (`2025-04-14`)</br>GPT-4.1-mini (`2025-04-14`)</br>GPT-4.1-nano (`2025-04-14`) </br>o4-mini (`2025-04-16`) |
31+
> [!NOTE]
32+
> The latest version of model router is currently limited access only. You can request access using the `gpt-5 access` form: [gpt-5 limited access model application](https://aka.ms/oai/gpt5access). If you already have `o3 access` no request is required.
3433
34+
|Model router version|Underlying models| Model version
35+
|:---:|:---|:----:|
36+
| `2025-08-07` | `gpt-4.1` </br>`gpt-4.1-mini` </br>`gpt-4.1-nano` </br>`o4-mini` </br> `gpt-5` <br> `gpt-5-mini` <br> `gpt-5-nano` <br> `gpt-5-chat` | `2025-04-14` <br> `2025-04-14` <br> `2025-04-14` <br> `2025-04-16` <br> `2025-08-07` <br> `2025-08-07` <br> `2025-08-07` <br> `2025-08-07` |
37+
|`2025-05-19`| `gpt-4.1` </br>`gpt-4.1-mini` </br>`gpt-4.1-nano` </br>`o4-mini` | `2025-04-14` <br> `2025-04-14` <br> `2025-04-14` <br> `2025-04-16` |
3538

3639
## Limitations
3740

articles/ai-foundry/openai/concepts/models.md

Lines changed: 30 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ description: Learn about the model capabilities that are available with Azure Op
55
author: mrbullwinkle #ChrisHMSFT
66
ms.author: mbullwin #chrhoder#
77
manager: nitinme
8-
ms.date: 07/24/2025
8+
ms.date: 08/06/2025
99
ms.service: azure-ai-openai
1010
ms.topic: conceptual
1111
ms.custom:
@@ -22,6 +22,7 @@ Azure OpenAI is powered by a diverse set of models with different capabilities a
2222

2323
| Models | Description |
2424
|--|--|
25+
| [`GPT-5 series`](#gpt-5) | **NEW** `gpt-5`, `gpt-5-mini`, `gpt-5-nano`, `gpt-5-chat` |
2526
| [gpt-oss](#gpt-oss) | **NEW** open-weight reasoning models |
2627
| [`codex-mini`](#o-series-models) | Fine-tuned version of o4-mini. |
2728
| [`GPT-4.1 series`](#gpt-41-series) | The latest model release from Azure OpenAI. |
@@ -35,6 +36,30 @@ Azure OpenAI is powered by a diverse set of models with different capabilities a
3536
| [`Image generation`](#image-generation-models) | A series of models that can generate original images from natural language. |
3637
| [`Audio`](#audio-models) | A series of models for speech to text, translation, and text to speech. GPT-4o audio models support either low latency *speech in, speech out* conversational interactions or audio generation. |
3738

39+
## GPT-5
40+
41+
### Region availability
42+
43+
| Model | Region |
44+
|---|---|
45+
| `gpt-5` (2025-08-07) | East US 2 (Global Standard & Data Zones), Sweden Central (Global Standard & Data Zones)|
46+
| `gpt-5-mini` (2025-08-07) | East US 2 (Global Standard & Data Zones), Sweden Central (Global Standard & Data Zones)|
47+
| `gpt-5-nano` (2025-08-07) | East US 2 (Global Standard & Data Zones), Sweden Central (Global Standard & Data Zones)|
48+
| `gpt-5-chat` (2025-08-07) | East US 2 (Global Standard), Sweden Central (Global Standard)|
49+
50+
- **[Registration is required for access to the gpt-5 model](https://aka.ms/oai/gpt5access).**
51+
52+
- `gpt-5-mini`, `gpt-5-nano`, and `gpt-5-chat` do not require registration.
53+
54+
Access will be granted based on Microsoft's eligibility criteria. Customers who previously applied and received access to `o3`, don't need to reapply as their approved subscriptions will automatically be granted access upon model release.
55+
56+
| Model ID | Description | Context Window | Max Output Tokens | Training Data (up to) |
57+
| --- | :--- |:--- |:---|:---: |
58+
| `gpt-5` (2025-08-07) | - [Reasoning](../how-to/reasoning.md) <br> - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> [Full summary of capabilities](../how-to/reasoning.md). | 272,000 | 128,000 | October 24, 2024 |
59+
| `gpt-5-mini` (2025-08-07) | - [Reasoning](../how-to/reasoning.md) <br> - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> [Full summary of capabilities](../how-to/reasoning.md). | 272,000 | 128,000 | June 24, 2024 |
60+
| `gpt-5-nano` (2025-08-07) | - [Reasoning](../how-to/reasoning.md) <br> - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> [Full summary of capabilities](../how-to/reasoning.md). | 272,000 | 128,000 | May 31, 2024 |
61+
| `gpt-5-chat` (2025-08-07)<br>**Preview** | - Chat Completions API. <br> - [Responses API](../how-to/responses.md). - Text only | 128,000 | 16,384 | October 24, 2024 |
62+
3863
## gpt-oss
3964

4065
### Region availability
@@ -108,13 +133,15 @@ A model that intelligently selects from a set of underlying chat models to respo
108133

109134
| Model | Region |
110135
|---|---|
111-
| `model-router` (2025-05-19) | East US 2 (Global Standard), Sweden Central (Global Standard)|
136+
| `model-router` (2025-08-07) | East US 2 (Global Standard), Sweden Central (Global Standard) |
137+
| `model-router` (2025-05-19) | East US 2 (Global Standard), Sweden Central (Global Standard) |
112138

113139
### Capabilities
114140

115141
| Model ID | Description | Context window | Max output tokens | Training data (up to) |
116142
| --- | :--- |:--- |:---|:---: |
117-
| `model-router` (2025-05-19) | A model that intelligently selects from a set of underlying chat models to respond to a given prompt. | 200,000 | 32,768 (GPT-4.1 series)</br> 100,000 (o4-mini) | May 31, 2024 |
143+
| `model-router` (2025-08-07) | A model that intelligently selects from a set of underlying models to respond to a given prompt. | 200,000 | 32,768 (`GPT-4.1 series`)</br> 100,000 (`o4-mini`)</br> 128,000 (`gpt-5 reasoning models`) </br> 16,384 (`gpt-5-chat`) | - |
144+
| `model-router` (2025-05-19) | A model that intelligently selects from a set of underlying chat models to respond to a given prompt. | 200,000 | 32,768 (`GPT-4.1 series`)</br> 100,000 (`o4-mini`) | May 31, 2024 |
118145

119146
Larger context windows are compatible with *some* of the underlying models. That means an API call with a larger context succeeds only if the prompt happens to be routed to the right model. Otherwise, the call fails.
120147

articles/ai-foundry/openai/how-to/function-calling.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -41,6 +41,9 @@ At a high level you can break down working with functions into three steps:
4141
* `gpt-4o-mini` (`2024-07-18`)
4242
* `gpt-4.1` (`2025-04-14`)
4343
* `gpt-4.1-mini` (`2025-04-14`)
44+
* `gpt-5` (`2025-08-07`)
45+
* `gpt-5-mini` (`2025-08-07`)
46+
* `gpt-5-nano` (`2025-08-07`)
4447

4548
Support for parallel function was first added in API version [`2023-12-01-preview`](https://github.com/Azure/azure-rest-api-specs/blob/main/specification/cognitiveservices/data-plane/AzureOpenAI/inference/preview/2023-12-01-preview/inference.json)
4649

0 commit comments

Comments
 (0)