Skip to content

Commit 5768a41

Browse files
committed
update limit info
1 parent 8543eca commit 5768a41

File tree

3 files changed

+6
-4
lines changed

3 files changed

+6
-4
lines changed

articles/ai-services/openai/concepts/model-router.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -29,7 +29,7 @@ If you select **Auto-update** at the deployment step (see [Manage models](/azure
2929

3030
|Model router version|Underlying models (version)|
3131
|---|---|
32-
|`2025-04-15`|GPT-4o (`2025-05-13`)</br>GPT-4o-mini (`2024-07-18`)</br>o3-mini (`2025-01-31`) |
32+
|`2025-04-15`|GPT-4.1 (`2025-04-14`)</br>GPT-4.1-mini (`2025-04-14`)</br>GPT-4.1-nano (`2025-04-14`) </br>o4-mini (`2025-04-16`) |
3333

3434

3535
## Limitations

articles/ai-services/openai/concepts/models.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -56,13 +56,13 @@ A model that intelligently selects from a set of underlying chat models to respo
5656

5757
| Model | Region |
5858
|---|---|
59-
| `model-router` (2025-04-15) | East US2 (Global Standard), Sweden Central (Global Standard)|
59+
| `model-router` (2025-04-15) | East US 2 (Global Standard), Sweden Central (Global Standard)|
6060

6161
### Capabilities
6262

6363
| Model ID | Description | Context Window | Max Output Tokens | Training Data (up to) |
6464
| --- | :--- |:--- |:---|:---: |
65-
| `model-router` (2025-04-15) | A model that intelligently selects from a set of underlying chat models to respond to a given prompt. <br> -TBD | 128,000 | 4096 | N/A |
65+
| `model-router` (2025-04-15) | A model that intelligently selects from a set of underlying chat models to respond to a given prompt. | 128,000 | 32768 (GPT 4.1 series)</br> 100 K (o4-mini) | May 31, 2024 |
6666

6767
## computer-use-preview
6868

articles/ai-services/openai/quotas-limits.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -94,7 +94,9 @@ The following sections provide you with a quick guide to the default quotas and
9494

9595
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
9696
|---|---|:---:|:---:|
97-
| `model-router` (2025-04-15) | Default | 128 K | TBD |
97+
| `model-router` (2025-04-15) | Enterprise Tier | 10 M | 10 K |
98+
| `model-router` (2025-04-15) | Default | 1 M | 1 K |
99+
98100

99101
## computer-use-preview global standard rate limits
100102

0 commit comments

Comments
 (0)