Skip to content

Commit b7d9ff9

Browse files
Merge pull request #6427 from mrbullwinkle/mrb_08_06_2025_vnext_001
[Azure OpenAI] [Release Branch] Update 001
2 parents b851d0a + 413d84b commit b7d9ff9

File tree

1 file changed

+28
-3
lines changed

1 file changed

+28
-3
lines changed

articles/ai-foundry/openai/concepts/models.md

Lines changed: 28 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ description: Learn about the model capabilities that are available with Azure Op
55
author: mrbullwinkle #ChrisHMSFT
66
ms.author: mbullwin #chrhoder#
77
manager: nitinme
8-
ms.date: 07/24/2025
8+
ms.date: 08/06/2025
99
ms.service: azure-ai-openai
1010
ms.topic: conceptual
1111
ms.custom:
@@ -22,6 +22,7 @@ Azure OpenAI is powered by a diverse set of models with different capabilities a
2222

2323
| Models | Description |
2424
|--|--|
25+
| [`GPT-5 series`](#gpt-5) | **NEW** `gpt-5`, `gpt-5-mini`, `gpt-5-nano`, `gpt-5-chat` |
2526
| [gpt-oss](#gpt-oss) | **NEW** open-weight reasoning models |
2627
| [`codex-mini`](#o-series-models) | Fine-tuned version of o4-mini. |
2728
| [`GPT-4.1 series`](#gpt-41-series) | The latest model release from Azure OpenAI. |
@@ -35,6 +36,28 @@ Azure OpenAI is powered by a diverse set of models with different capabilities a
3536
| [`Image generation`](#image-generation-models) | A series of models that can generate original images from natural language. |
3637
| [`Audio`](#audio-models) | A series of models for speech to text, translation, and text to speech. GPT-4o audio models support either low latency *speech in, speech out* conversational interactions or audio generation. |
3738

39+
## GPT-5
40+
41+
### Region availability
42+
43+
| Model | Region |
44+
|---|---|
45+
| `gpt-5` (2025-08-07) | East US 2 (Global Standard), Sweden Central (Global Standard)|
46+
| `gpt-5-mini` (2025-08-07) | East US 2 (Global Standard), Sweden Central (Global Standard)|
47+
| `gpt-5-nano` (2025-08-07) | East US 2 (Global Standard), Sweden Central (Global Standard)|
48+
| `gpt-5-chat` (2025-08-07) | East US 2 (Global Standard), Sweden Central (Global Standard)|
49+
50+
**[Registration is required for access to GPT-5 series models](https://aka.ms/oai/gpt5access).**
51+
52+
Access will be granted based on Microsoft's eligibility criteria. Customers who previously applied and received access to `o3`, don't need to reapply as their approved subscriptions will automatically be granted access upon model release.
53+
54+
| Model ID | Description | Context Window | Max Output Tokens | Training Data (up to) |
55+
| --- | :--- |:--- |:---|:---: |
56+
| `gpt-5` (2025-08-07) | - [Reasoning](../how-to/reasoning.md) <br> - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> [Full summary of capabilities](../how-to/reasoning.md). | 272,000 | 128,000 | - |
57+
| `gpt-5-mini` (2025-08-07) | - [Reasoning](../how-to/reasoning.md) <br> - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> [Full summary of capabilities](../how-to/reasoning.md). | 272,000 | 128,000 | - |
58+
| `gpt-5-nano` (2025-08-07) | - [Reasoning](../how-to/reasoning.md) <br> - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> [Full summary of capabilities](../how-to/reasoning.md). | 272,000 | 128,000 | - |
59+
| `gpt-5-chat` (2025-08-07) | - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. | 128,000 | 16,000 | - |
60+
3861
## gpt-oss
3962

4063
### Region availability
@@ -108,13 +131,15 @@ A model that intelligently selects from a set of underlying chat models to respo
108131

109132
| Model | Region |
110133
|---|---|
111-
| `model-router` (2025-05-19) | East US 2 (Global Standard), Sweden Central (Global Standard)|
134+
| `model-router` (2025-08-07) | East US 2 (Global Standard), Sweden Central (Global Standard) |
135+
| `model-router` (2025-05-19) | East US 2 (Global Standard), Sweden Central (Global Standard) |
112136

113137
### Capabilities
114138

115139
| Model ID | Description | Context window | Max output tokens | Training data (up to) |
116140
| --- | :--- |:--- |:---|:---: |
117-
| `model-router` (2025-05-19) | A model that intelligently selects from a set of underlying chat models to respond to a given prompt. | 200,000 | 32,768 (GPT-4.1 series)</br> 100,000 (o4-mini) | May 31, 2024 |
141+
| `model-router` (2025-08-07) | A model that intelligently selects from a set of underlying models to respond to a given prompt. | 200,000 | 32,768 (`GPT-4.1 series`)</br> 100,000 (`o4-mini`)</br> 128,000 (`gpt-5 reasoning models`) <br> 16,000 (`gpt-5-chat`) | - |
142+
| `model-router` (2025-05-19) | A model that intelligently selects from a set of underlying chat models to respond to a given prompt. | 200,000 | 32,768 (`GPT-4.1 series`)</br> 100,000 (`o4-mini`) | May 31, 2024 |
118143

119144
Larger context windows are compatible with *some* of the underlying models. That means an API call with a larger context succeeds only if the prompt happens to be routed to the right model. Otherwise, the call fails.
120145

0 commit comments

Comments
 (0)