Skip to content

Commit f7ec1e8

Browse files
committed
updating models
1 parent f8ed851 commit f7ec1e8

File tree

2 files changed

+33
-34
lines changed

2 files changed

+33
-34
lines changed

articles/ai-services/openai/how-to/provisioned-throughput-onboarding.md

Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -82,14 +82,14 @@ The amount of throughput (measured in tokens per minute or TPM) a deployment get
8282
For example, for `gpt-4.1:2025-04-14`, 1 output token counts as 4 input tokens towards your utilization limit which matches the [pricing](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/). Older models use a different ratio and for a deeper understanding on how different ratios of input and output tokens impact the throughput your workload needs, see the [Azure OpenAI capacity calculator](https://ai.azure.com/resource/calculator).
8383

8484

85-
|Topic| **gpt-4o** | **gpt-4o-mini** | **o1**| **o3-mini** | **gpt-4.1** |
86-
| --- | --- | --- | --- | --- | --- |
87-
|Global & data zone provisioned minimum deployment|15|15|15| 15 | 15 |
88-
|Global & data zone provisioned scale increment|5|5|5 | 5 | 5 |
89-
|Regional provisioned minimum deployment|50|25|50| 25 | 50 |
90-
|Regional provisioned scale increment|50|25|50| 25 | 50 |
91-
|Input TPM per PTU |2,500|37,000|230| | 3000 |
92-
|Latency Target Value |25 Tokens Per Second|33 Tokens Per Second|25 Tokens Per Second| | 44 Tokens Per Second |
85+
|Topic| **gpt-4o** | **gpt-4o-mini** | **o1**| **o3-mini** | **gpt-4.1** | **gpt-4.1-mini** |
86+
| --- | --- | --- | --- | --- | --- | --- |
87+
|Global & data zone provisioned minimum deployment|15|15|15| 15 | 15 | 15 |
88+
|Global & data zone provisioned scale increment|5|5|5 | 5 | 5 | 5 |
89+
|Regional provisioned minimum deployment|50|25|25| 25 | 50 | 25 |
90+
|Regional provisioned scale increment|50|25|50| 25 | 50 | 25 |
91+
|Input TPM per PTU |2,500|37,000|230| | 3000 | 14,900 |
92+
|Latency Target Value |25 Tokens Per Second|33 Tokens Per Second|25 Tokens Per Second| | 44 Tokens Per Second | 50 Tokens Per Second |
9393

9494
For a full list, see the [Azure OpenAI Service in Azure AI Foundry portal calculator](https://ai.azure.com/resource/calculator).
9595

articles/ai-services/openai/includes/model-matrix/provisioned-global.md

Lines changed: 25 additions & 26 deletions
Original file line numberDiff line numberDiff line change
@@ -11,33 +11,32 @@ ms.date: 05/05/2025
1111

1212

1313

14+
|
1415
| **Region** | **gpt-4.1**, **2025-04-14** | **gpt-4.1-mini**, **2025-04-14** | **o3-mini**, **2025-01-31** | **o1**, **2024-12-17** | **gpt-4o**, **2024-05-13** | **gpt-4o**, **2024-08-06** | **gpt-4o**, **2024-11-20** | **gpt-4o-mini**, **2024-07-18** |
1516
|:-------------------|:---------------------------:|:--------------------------------:|:---------------------------:|:----------------------:|:--------------------------:|:--------------------------:|:--------------------------:|:-------------------------------:|
16-
| australiaeast |||||||||
17-
| brazilsouth |||||||||
18-
| canadacentral |||||||||
19-
| canadaeast |||||||||
20-
| eastus |||||||||
21-
| eastus2 |||||||||
22-
| francecentral |||||||||
23-
| germanywestcentral |||||||||
24-
| global | - | - | - ||||||
25-
| italynorth |||||||||
26-
| japaneast |||||||||
27-
| koreacentral |||||||||
28-
| northcentralus |||||||||
29-
| norwayeast |||||||||
30-
| polandcentral |||||||||
31-
| southafricanorth |||||||||
17+
| australiaeast || - |||||||
18+
| brazilsouth || - |||||||
19+
| canadaeast || - |||||||
20+
| eastus || - |||||||
21+
| eastus2 || - |||||||
22+
| francecentral || - |||||||
23+
| germanywestcentral || - |||||||
24+
| italynorth || - |||||||
25+
| japaneast || - |||||||
26+
| koreacentral || - |||||||
27+
| northcentralus || - |||||||
28+
| norwayeast || - |||||||
29+
| polandcentral || - |||||||
30+
| southafricanorth || - |||||||
3231
| southcentralus |||||||||
33-
| southeastasia || |||||||
34-
| southindia || |||||||
35-
| spaincentral || |||||||
36-
| swedencentral || |||||||
37-
| switzerlandnorth || |||||||
32+
| southeastasia || - |||||||
33+
| southindia || - |||||||
34+
| spaincentral || - |||||||
35+
| swedencentral || - |||||||
36+
| switzerlandnorth || - |||||||
3837
| switzerlandwest || - |||||||
39-
| uaenorth || |||||||
40-
| uksouth || |||||||
41-
| westeurope || |||||||
42-
| westus || |||||||
43-
| westus3 || |||||||
38+
| uaenorth || - |||||||
39+
| uksouth || - |||||||
40+
| westeurope || - |||||||
41+
| westus || - |||||||
42+
| westus3 || - |||||||

0 commit comments

Comments
 (0)