Skip to content

Commit a48b13a

Browse files
Merge pull request #6430 from mrbullwinkle/mrb_08_06_2025_vnext_002
[Azure OpenAI] [Release Branch] Model Router updates
2 parents b7d9ff9 + 0803e4e commit a48b13a

File tree

2 files changed

+6
-5
lines changed

2 files changed

+6
-5
lines changed

articles/ai-foundry/openai/concepts/model-router.md

Lines changed: 5 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ description: Learn about the model router feature in Azure OpenAI in Azure AI Fo
55
author: PatrickFarley
66
ms.author: pafarley
77
manager: nitinme
8-
ms.date: 05/08/2025
8+
ms.date: 08/06/2025
99
ms.service: azure-ai-openai
1010
ms.topic: conceptual
1111
ms.custom:
@@ -28,9 +28,10 @@ If you select **Auto-update** at the deployment step (see [Manage models](/azure
2828

2929
## Underlying models
3030

31-
|Model router version|Underlying models (version)|
32-
|---|---|
33-
|`2025-05-19`|GPT-4.1 (`2025-04-14`)</br>GPT-4.1-mini (`2025-04-14`)</br>GPT-4.1-nano (`2025-04-14`) </br>o4-mini (`2025-04-16`) |
31+
|Model router version|Underlying models| Model version
32+
|:---:|:---|:----:|
33+
| `2025-08-07` | `gpt-4.1` </br>`gpt-4.1-mini` </br>`gpt-4.1-nano` </br>`o4-mini` </br> `gpt-5` <br> `gpt-5-mini` <br> `gpt-5-nano` | `2025-04-14` <br> `2025-04-14` <br> `2025-04-14` <br> `2025-04-16` <br> `2025-08-07` <br> `2025-08-07` <br> `2025-08-07` |
34+
|`2025-05-19`| `gpt-4.1` </br>`gpt-4.1-mini` </br>`gpt-4.1-nano` </br>`o4-mini` | `2025-04-14` <br> `2025-04-14` <br> `2025-04-14` <br> `2025-04-16` |
3435

3536

3637
## Limitations

articles/ai-foundry/openai/concepts/models.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -138,7 +138,7 @@ A model that intelligently selects from a set of underlying chat models to respo
138138

139139
| Model ID | Description | Context window | Max output tokens | Training data (up to) |
140140
| --- | :--- |:--- |:---|:---: |
141-
| `model-router` (2025-08-07) | A model that intelligently selects from a set of underlying models to respond to a given prompt. | 200,000 | 32,768 (`GPT-4.1 series`)</br> 100,000 (`o4-mini`)</br> 128,000 (`gpt-5 reasoning models`) <br> 16,000 (`gpt-5-chat`) | - |
141+
| `model-router` (2025-08-07) | A model that intelligently selects from a set of underlying models to respond to a given prompt. | 200,000 | 32,768 (`GPT-4.1 series`)</br> 100,000 (`o4-mini`)</br> 128,000 (`gpt-5 reasoning models`) | - |
142142
| `model-router` (2025-05-19) | A model that intelligently selects from a set of underlying chat models to respond to a given prompt. | 200,000 | 32,768 (`GPT-4.1 series`)</br> 100,000 (`o4-mini`) | May 31, 2024 |
143143

144144
Larger context windows are compatible with *some* of the underlying models. That means an API call with a larger context succeeds only if the prompt happens to be routed to the right model. Otherwise, the call fails.

0 commit comments

Comments
 (0)