MicrosoftDocs
diff --git a/‎articles/ai-foundry/openai/concepts/model-router.md‎
Lines changed: 7 additions & 4 deletions b/‎articles/ai-foundry/openai/concepts/model-router.md‎
Lines changed: 7 additions & 4 deletions
diff --git a/‎articles/ai-foundry/openai/concepts/models.md‎
Lines changed: 30 additions & 3 deletions b/‎articles/ai-foundry/openai/concepts/models.md‎
Lines changed: 30 additions & 3 deletions
diff --git a/‎articles/ai-foundry/openai/how-to/function-calling.md‎
Lines changed: 3 additions & 0 deletions b/‎articles/ai-foundry/openai/how-to/function-calling.md‎
Lines changed: 3 additions & 0 deletions
@@ -5,7 +5,7 @@ description: Learn about the model router feature in Azure OpenAI in Azure AI Fo
 author: PatrickFarley
 ms.author: pafarley
 manager: nitinme
-ms.date: 05/08/2025
+ms.date: 08/06/2025
 ms.service: azure-ai-openai
 ms.topic: conceptual
 ms.custom:
@@ -28,10 +28,13 @@ If you select **Auto-update** at the deployment step (see [Manage models](/azure
 
 ## Underlying models
 
-|Model router version|Underlying models (version)|
-|---|---|
-|`2025-05-19`|GPT-4.1 (`2025-04-14`)</br>GPT-4.1-mini (`2025-04-14`)</br>GPT-4.1-nano (`2025-04-14`) </br>o4-mini (`2025-04-16`) |
+> [!NOTE]
+> The latest version of model router is currently limited access only. You can request access using the `gpt-5 access` form: [gpt-5 limited access model application](https://aka.ms/oai/gpt5access). If you already have `o3 access` no request is required.
 
+|Model router version|Underlying models| Model version
+|:---:|:---|:----:|
+| `2025-08-07` | `gpt-4.1` </br>`gpt-4.1-mini` </br>`gpt-4.1-nano` </br>`o4-mini` </br> `gpt-5`  <br> `gpt-5-mini`  <br> `gpt-5-nano` <br> `gpt-5-chat`   | `2025-04-14` <br> `2025-04-14` <br> `2025-04-14` <br> `2025-04-16` <br> `2025-08-07` <br> `2025-08-07` <br> `2025-08-07` <br> `2025-08-07` |
+|`2025-05-19`| `gpt-4.1` </br>`gpt-4.1-mini` </br>`gpt-4.1-nano` </br>`o4-mini`  |  `2025-04-14` <br> `2025-04-14` <br> `2025-04-14` <br> `2025-04-16` |
 
 ## Limitations
 
 
@@ -5,7 +5,7 @@ description: Learn about the model capabilities that are available with Azure Op
 author: mrbullwinkle #ChrisHMSFT
 ms.author: mbullwin #chrhoder#
 manager: nitinme
-ms.date: 07/24/2025
+ms.date: 08/06/2025
 ms.service: azure-ai-openai
 ms.topic: conceptual
 ms.custom:
@@ -22,6 +22,7 @@ Azure OpenAI is powered by a diverse set of models with different capabilities a
 
 | Models | Description |
 |--|--|
+| [`GPT-5 series`](#gpt-5) | **NEW** `gpt-5`, `gpt-5-mini`, `gpt-5-nano`, `gpt-5-chat` |
 | [gpt-oss](#gpt-oss) | **NEW** open-weight reasoning models |
 | [`codex-mini`](#o-series-models) | Fine-tuned version of o4-mini. |  
 | [`GPT-4.1 series`](#gpt-41-series) | The latest model release from Azure OpenAI. |
@@ -35,6 +36,30 @@ Azure OpenAI is powered by a diverse set of models with different capabilities a
 | [`Image generation`](#image-generation-models) | A series of models that can generate original images from natural language. |
 | [`Audio`](#audio-models) | A series of models for speech to text, translation, and text to speech. GPT-4o audio models support either low latency *speech in, speech out* conversational interactions or audio generation. |
 
+## GPT-5
+
+### Region availability
+
+| Model | Region |
+|---|---|
+| `gpt-5` (2025-08-07) | East US 2 (Global Standard & Data Zones), Sweden Central (Global Standard & Data Zones)|
+| `gpt-5-mini` (2025-08-07) | East US 2 (Global Standard & Data Zones), Sweden Central (Global Standard & Data Zones)|
+| `gpt-5-nano` (2025-08-07) | East US 2 (Global Standard & Data Zones), Sweden Central (Global Standard & Data Zones)|
+| `gpt-5-chat` (2025-08-07) | East US 2 (Global Standard), Sweden Central (Global Standard)|
+
+- **[Registration is required for access to the gpt-5 model](https://aka.ms/oai/gpt5access).**
+
+- `gpt-5-mini`, `gpt-5-nano`, and `gpt-5-chat` do not require registration.
+
+ Access will be granted based on Microsoft's eligibility criteria. Customers who previously applied and received access to `o3`, don't need to reapply as their approved subscriptions will automatically be granted access upon model release.
+
+|  Model ID  | Description | Context Window | Max Output Tokens | Training Data (up to)  |
+|  --- |  :--- |:--- |:---|:---: |
+| `gpt-5` (2025-08-07) |  - [Reasoning](../how-to/reasoning.md) <br> - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> [Full summary of capabilities](../how-to/reasoning.md).  | 272,000  | 128,000 | October 24, 2024 |
+| `gpt-5-mini` (2025-08-07) | - [Reasoning](../how-to/reasoning.md) <br> - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> [Full summary of capabilities](../how-to/reasoning.md).     | 272,000  | 128,000  | June 24, 2024 |
+| `gpt-5-nano` (2025-08-07) | - [Reasoning](../how-to/reasoning.md) <br> - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> [Full summary of capabilities](../how-to/reasoning.md).     | 272,000 | 128,000 | May 31, 2024 |
+| `gpt-5-chat` (2025-08-07)<br>**Preview** | - Chat Completions API. <br> - [Responses API](../how-to/responses.md). - Text only  | 128,000 | 16,384 | October 24, 2024 |
+
 ## gpt-oss
 
 ### Region availability
@@ -108,13 +133,15 @@ A model that intelligently selects from a set of underlying chat models to respo
 
 | Model | Region |
 |---|---|
-| `model-router` (2025-05-19) | East US 2 (Global Standard), Sweden Central (Global Standard)|
+| `model-router` (2025-08-07) | East US 2 (Global Standard), Sweden Central (Global Standard) |
+| `model-router` (2025-05-19) | East US 2 (Global Standard), Sweden Central (Global Standard) |
 
 ### Capabilities
 
 |  Model ID  | Description | Context window | Max output tokens | Training data (up to)  |
 |  --- |  :--- |:--- |:---|:---: |
-| `model-router` (2025-05-19) | A model that intelligently selects from a set of underlying chat models to respond to a given prompt. | 200,000 | 32,768 (GPT-4.1 series)</br> 100,000 (o4-mini) | May 31, 2024 |
+| `model-router` (2025-08-07) | A model that intelligently selects from a set of underlying  models to respond to a given prompt. | 200,000 | 32,768 (`GPT-4.1 series`)</br> 100,000 (`o4-mini`)</br> 128,000 (`gpt-5 reasoning models`) </br> 16,384 (`gpt-5-chat`) | - |
+| `model-router` (2025-05-19) | A model that intelligently selects from a set of underlying chat models to respond to a given prompt. | 200,000 | 32,768 (`GPT-4.1 series`)</br> 100,000 (`o4-mini`) | May 31, 2024 |
 
 Larger context windows are compatible with *some* of the underlying models. That means an API call with a larger context succeeds only if the prompt happens to be routed to the right model. Otherwise, the call fails.
 
 
@@ -41,6 +41,9 @@ At a high level you can break down working with functions into three steps:
 * `gpt-4o-mini` (`2024-07-18`)
 * `gpt-4.1` (`2025-04-14`)
 * `gpt-4.1-mini` (`2025-04-14`)
+* `gpt-5` (`2025-08-07`)
+* `gpt-5-mini` (`2025-08-07`)
+* `gpt-5-nano` (`2025-08-07`)
 
 Support for parallel function was first added in API version [`2023-12-01-preview`](https://github.com/Azure/azure-rest-api-specs/blob/main/specification/cognitiveservices/data-plane/AzureOpenAI/inference/preview/2023-12-01-preview/inference.json)