Merge pull request #6427 from mrbullwinkle/mrb_08_06_2025_vnext_001

prmerger-automator[bot] · web-flow · commit b7d9ff916784 · 2025-08-07T03:26:25.000Z
[Azure OpenAI] [Release Branch] Update 001
diff --git a/articles/ai-foundry/openai/concepts/models.md b/articles/ai-foundry/openai/concepts/models.md
@@ -5,7 +5,7 @@ description: Learn about the model capabilities that are available with Azure Op
 author: mrbullwinkle #ChrisHMSFT
 ms.author: mbullwin #chrhoder#
 manager: nitinme
-ms.date: 07/24/2025
+ms.date: 08/06/2025
 ms.service: azure-ai-openai
 ms.topic: conceptual
 ms.custom:
@@ -22,6 +22,7 @@ Azure OpenAI is powered by a diverse set of models with different capabilities a
 
 | Models | Description |
 |--|--|
+| [`GPT-5 series`](#gpt-5) | **NEW** `gpt-5`, `gpt-5-mini`, `gpt-5-nano`, `gpt-5-chat` |
 | [gpt-oss](#gpt-oss) | **NEW** open-weight reasoning models |
 | [`codex-mini`](#o-series-models) | Fine-tuned version of o4-mini. |  
 | [`GPT-4.1 series`](#gpt-41-series) | The latest model release from Azure OpenAI. |
@@ -35,6 +36,28 @@ Azure OpenAI is powered by a diverse set of models with different capabilities a
 | [`Image generation`](#image-generation-models) | A series of models that can generate original images from natural language. |
 | [`Audio`](#audio-models) | A series of models for speech to text, translation, and text to speech. GPT-4o audio models support either low latency *speech in, speech out* conversational interactions or audio generation. |
 
+## GPT-5
+
+### Region availability
+
+| Model | Region |
+|---|---|
+| `gpt-5` (2025-08-07) | East US 2 (Global Standard), Sweden Central (Global Standard)|
+| `gpt-5-mini` (2025-08-07) | East US 2 (Global Standard), Sweden Central (Global Standard)|
+| `gpt-5-nano` (2025-08-07) | East US 2 (Global Standard), Sweden Central (Global Standard)|
+| `gpt-5-chat` (2025-08-07) | East US 2 (Global Standard), Sweden Central (Global Standard)|
+
+ **[Registration is required for access to GPT-5 series models](https://aka.ms/oai/gpt5access).**
+
+ Access will be granted based on Microsoft's eligibility criteria. Customers who previously applied and received access to `o3`, don't need to reapply as their approved subscriptions will automatically be granted access upon model release.
+
+|  Model ID  | Description | Context Window | Max Output Tokens | Training Data (up to)  |
+|  --- |  :--- |:--- |:---|:---: |
+| `gpt-5` (2025-08-07) |  - [Reasoning](../how-to/reasoning.md) <br> - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> [Full summary of capabilities](../how-to/reasoning.md).  | 272,000  | 128,000 | - |
+| `gpt-5-mini` (2025-08-07) | - [Reasoning](../how-to/reasoning.md) <br> - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> [Full summary of capabilities](../how-to/reasoning.md).     | 272,000  | 128,000  | - |
+| `gpt-5-nano` (2025-08-07) | - [Reasoning](../how-to/reasoning.md) <br> - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> [Full summary of capabilities](../how-to/reasoning.md).     | 272,000 | 128,000 | - |
+| `gpt-5-chat` (2025-08-07) | - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. | 128,000 | 16,000 | - |
+
 ## gpt-oss
 
 ### Region availability
@@ -108,13 +131,15 @@ A model that intelligently selects from a set of underlying chat models to respo
 
 | Model | Region |
 |---|---|
-| `model-router` (2025-05-19) | East US 2 (Global Standard), Sweden Central (Global Standard)|
+| `model-router` (2025-08-07) | East US 2 (Global Standard), Sweden Central (Global Standard) |
+| `model-router` (2025-05-19) | East US 2 (Global Standard), Sweden Central (Global Standard) |
 
 ### Capabilities
 
 |  Model ID  | Description | Context window | Max output tokens | Training data (up to)  |
 |  --- |  :--- |:--- |:---|:---: |
-| `model-router` (2025-05-19) | A model that intelligently selects from a set of underlying chat models to respond to a given prompt. | 200,000 | 32,768 (GPT-4.1 series)</br> 100,000 (o4-mini) | May 31, 2024 |
+| `model-router` (2025-08-07) | A model that intelligently selects from a set of underlying  models to respond to a given prompt. | 200,000 | 32,768 (`GPT-4.1 series`)</br> 100,000 (`o4-mini`)</br> 128,000 (`gpt-5 reasoning models`) <br> 16,000 (`gpt-5-chat`) | - |
+| `model-router` (2025-05-19) | A model that intelligently selects from a set of underlying chat models to respond to a given prompt. | 200,000 | 32,768 (`GPT-4.1 series`)</br> 100,000 (`o4-mini`) | May 31, 2024 |
 
 Larger context windows are compatible with *some* of the underlying models. That means an API call with a larger context succeeds only if the prompt happens to be routed to the right model. Otherwise, the call fails.