Skip to content

Commit bf0a78e

Browse files
committed
new models
1 parent 7c29e73 commit bf0a78e

File tree

3 files changed

+14
-14
lines changed

3 files changed

+14
-14
lines changed

articles/ai-services/openai/how-to/provisioned-throughput-onboarding.md

Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -77,14 +77,14 @@ The amount of throughput (measured in tokens per minute or TPM) a deployment get
7777

7878
For example, for `gpt-4.1:2025-04-14`, 1 output token counts as 4 input tokens towards your utilization limit which matches the [pricing](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/). Older models use a different ratio and for a deeper understanding on how different ratios of input and output tokens impact the throughput your workload needs, see the [Azure OpenAI capacity calculator](https://ai.azure.com/resource/calculator).
7979

80-
|Topic| **gpt-4.1** | **gpt-4.1-mini** | **o3-mini** | **o1** | **gpt-4o** | **gpt-4o-mini** |
81-
| --- | --- | --- | --- | --- | --- | --- |
82-
|Global & data zone provisioned minimum deployment|15|15|15|15|15|15|
83-
|Global & data zone provisioned scale increment|5|5|5|5|5|5|
84-
|Regional provisioned minimum deployment|50|25|25|25|50|25|
85-
|Regional provisioned scale increment|50|25|25|50|50|25|
86-
|Input TPM per PTU|3,000|14,900|2,500|230|2,500|37,000|
87-
|Latency Target Value|44 Tokens Per Second|50 Tokens Per Second| 66 Tokens Per Second |25 Tokens Per Second|25 Tokens Per Second|33 Tokens Per Second|
80+
|Topic| **gpt-4.1** | **gpt-4.1-mini** | **gpt-4.1-nano** | **o3** | **o3-mini** | **o1** | **gpt-4o** | **gpt-4o-mini** |
81+
| --- | --- | --- | --- | --- | --- | --- | --- |
82+
|Global & data zone provisioned minimum deployment|15|15| 15 |15|15|15|15|
83+
|Global & data zone provisioned scale increment|5|5| 5 |5|5|5|5|
84+
|Regional provisioned minimum deployment|50|25|50 | 25|25|50|25|
85+
|Regional provisioned scale increment|50|25| 50 | 25|50|50|25|
86+
|Input TPM per PTU|3,000|14,900| 600 | 2,500|230|2,500|37,000|
87+
|Latency Target Value|44 Tokens Per Second|50 Tokens Per Second| 40 Tokens Per Second | 66 Tokens Per Second |25 Tokens Per Second|25 Tokens Per Second|33 Tokens Per Second|
8888

8989

9090
For a full list, see the [Azure OpenAI in Azure AI Foundry Models in Azure AI Foundry portal calculator](https://ai.azure.com/resource/calculator).

articles/ai-services/openai/includes/model-matrix/provisioned-global.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@ ms.date: 05/05/2025
1313
| **Region** | **o3**, **2025-04-16** | **o4-mini**, **2025-04-16** | **gpt-4.1**, **2025-04-14** | **gpt-4.1-nano**, **2025-04-14** | **gpt-4.1-mini**, **2025-04-14** | **o3-mini**, **2025-01-31** | **o1**, **2024-12-17** | **gpt-4o**, **2024-05-13** | **gpt-4o**, **2024-08-06** | **gpt-4o**, **2024-11-20** | **gpt-4o-mini**, **2024-07-18** |
1414
|:-------------------|:----------------------:|:---------------------------:|:---------------------------:|:--------------------------------:|:--------------------------------:|:---------------------------:|:----------------------:|:--------------------------:|:--------------------------:|:--------------------------:|:-------------------------------:|
1515
| australiaeast ||||||||||||
16-
| brazilsouth |||| - ||||||||
16+
| brazilsouth |||| ||||||||
1717
| canadaeast ||||||||||||
1818
| eastus ||||||||||||
1919
| eastus2 ||||||||||||
@@ -25,14 +25,14 @@ ms.date: 05/05/2025
2525
| northcentralus |||| - ||||||||
2626
| norwayeast ||||||||||||
2727
| polandcentral ||||||||||||
28-
| southafricanorth |||| - ||||||||
28+
| southafricanorth |||| ||||||||
2929
| southcentralus ||||||||||||
30-
| southeastasia |||| - ||||||||
31-
| southindia |||| - ||||||||
30+
| southeastasia |||| ||||||||
31+
| southindia |||| ||||||||
3232
| spaincentral ||||||||||||
3333
| swedencentral ||||||||||||
3434
| switzerlandnorth ||||||||||||
35-
| switzerlandwest |||| - ||||||||
35+
| switzerlandwest |||| ||||||||
3636
| uaenorth ||||||||||||
3737
| uksouth ||||||||||||
3838
| westeurope ||||||||||||

articles/ai-services/openai/includes/model-matrix/provisioned-models.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,7 @@ ms.date: 05/07/2025
2121
| francecentral | - | - | - | - | - | - | - ||| - ||||| - || - ||
2222
| germanywestcentral | - | - | - | - | - | - | - |||| - ||||||| - |
2323
| japaneast | - | - | - | - | - ||||||| - |||| - | - ||
24-
| koreacentral | - | - | - | - | - || - |||||| - | - |||| - |
24+
| koreacentral | - | - | | - | - || - |||||| - | - |||| - |
2525
| northcentralus | - ||| - | - | - | - ||||||||||||
2626
| norwayeast | - | - | - | - | - | - | - ||| - ||| - || - || - | - |
2727
| polandcentral | - | - | - | - | - | - | - || - || - ||||||||

0 commit comments

Comments
 (0)