Skip to content

Commit 9e37e70

Browse files
authored
Merge pull request #4599 from aahill/may-model-update
updating models
2 parents 876b5c6 + 1cf7ff8 commit 9e37e70

File tree

4 files changed

+81
-79
lines changed

4 files changed

+81
-79
lines changed

articles/ai-services/openai/how-to/provisioned-throughput-onboarding.md

Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -81,15 +81,15 @@ The amount of throughput (measured in tokens per minute or TPM) a deployment get
8181

8282
For example, for `gpt-4.1:2025-04-14`, 1 output token counts as 4 input tokens towards your utilization limit which matches the [pricing](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/). Older models use a different ratio and for a deeper understanding on how different ratios of input and output tokens impact the throughput your workload needs, see the [Azure OpenAI capacity calculator](https://ai.azure.com/resource/calculator).
8383

84+
|Topic| **gpt-4.1** | **gpt-4.1-mini** | **o3-mini** | **o1** | **gpt-4o** | **gpt-4o-mini** |
85+
| --- | --- | --- | --- | --- | --- | --- |
86+
|Global & data zone provisioned minimum deployment|15|15|15|15|15|15|
87+
|Global & data zone provisioned scale increment|5|5|5|5|5|5|
88+
|Regional provisioned minimum deployment|50|25|25|25|50|25|
89+
|Regional provisioned scale increment|50|25|25|50|50|25|
90+
|Input TPM per PTU|3,000|14,900|2,500|230|2,500|37,000|
91+
|Latency Target Value|44 Tokens Per Second|50 Tokens Per Second| 66 Tokens Per Second |25 Tokens Per Second|25 Tokens Per Second|33 Tokens Per Second|
8492

85-
|Topic| **gpt-4o** | **gpt-4o-mini** | **o1**| gpt-4.1 |
86-
| --- | --- | --- | --- | --- |
87-
|Global & data zone provisioned minimum deployment|15|15|15|15 |
88-
|Global & data zone provisioned scale increment|5|5|5| 5 |
89-
|Regional provisioned minimum deployment|50|25|50| 50 |
90-
|Regional provisioned scale increment|50|25|50| 50 |
91-
|Input TPM per PTU |2,500|37,000|230| 3000 |
92-
|Latency Target Value |25 Tokens Per Second|33 Tokens Per Second|25 Tokens Per Second| 44 Tokens Per Second |
9393

9494
For a full list, see the [Azure OpenAI Service in Azure AI Foundry portal calculator](https://ai.azure.com/resource/calculator).
9595

articles/ai-services/openai/includes/model-matrix/datazone-provisioned-managed.md

Lines changed: 16 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -9,18 +9,19 @@ ms.custom: references_regions
99
ms.date: 05/05/2025
1010
---
1111

12-
| **Region** | **gpt-4.1**, **2025-04-14** | **o3-mini**, **2025-01-31** | **o1**, **2024-12-17** | **gpt-4o**, **2024-05-13** | **gpt-4o**, **2024-08-06** | **gpt-4o**, **2024-11-20** | **gpt-4o-mini**, **2024-07-18** |
13-
|:-------------------|:---------------------------:|:---------------------------:|:----------------------:|:--------------------------:|:--------------------------:|:--------------------------:|:-------------------------------:|
14-
| eastus ||||||||
15-
| eastus2 ||||||||
16-
| francecentral ||||||||
17-
| germanywestcentral ||||||||
18-
| italynorth || - | - | - | - | - | - |
19-
| northcentralus ||||||||
20-
| polandcentral ||||||||
21-
| southcentralus ||||||||
22-
| spaincentral ||||||||
23-
| swedencentral ||||||||
24-
| westeurope ||||||||
25-
| westus ||||||||
26-
| westus3 ||||||||
12+
13+
| **Region** | **gpt-4.1**, **2025-04-14** | **gpt-4.1-mini**, **2025-04-14** | **o3-mini**, **2025-01-31** | **o1**, **2024-12-17** | **gpt-4o**, **2024-05-13** | **gpt-4o**, **2024-08-06** | **gpt-4o**, **2024-11-20** | **gpt-4o-mini**, **2024-07-18** |
14+
|:-------------------|:---------------------------:|:--------------------------------:|:---------------------------:|:----------------------:|:--------------------------:|:--------------------------:|:--------------------------:|:-------------------------------:|
15+
| eastus |||||||||
16+
| eastus2 |||||||||
17+
| francecentral |||||||||
18+
| germanywestcentral |||||||||
19+
| italynorth || - | - | - | - | - | - | - |
20+
| northcentralus |||||||||
21+
| polandcentral |||||||||
22+
| southcentralus |||||||||
23+
| spaincentral |||||||||
24+
| swedencentral |||||||||
25+
| westeurope |||||||||
26+
| westus |||||||||
27+
| westus3 |||||||||

0 commit comments

Comments
 (0)