Skip to content

Commit 577f29a

Browse files
committed
updating models
1 parent eb8525a commit 577f29a

File tree

1 file changed

+8
-8
lines changed

1 file changed

+8
-8
lines changed

articles/ai-services/openai/how-to/provisioned-throughput-onboarding.md

Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -82,14 +82,14 @@ The amount of throughput (measured in tokens per minute or TPM) a deployment get
8282
For example, for `gpt-4.1:2025-04-14`, 1 output token counts as 4 input tokens towards your utilization limit which matches the [pricing](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/). Older models use a different ratio and for a deeper understanding on how different ratios of input and output tokens impact the throughput your workload needs, see the [Azure OpenAI capacity calculator](https://ai.azure.com/resource/calculator).
8383

8484

85-
|Topic| **gpt-4o** | **gpt-4o-mini** | **o1**| gpt-4.1 |
86-
| --- | --- | --- | --- | --- |
87-
|Global & data zone provisioned minimum deployment|15|15|15|15 |
88-
|Global & data zone provisioned scale increment|5|5|5| 5 |
89-
|Regional provisioned minimum deployment|50|25|50| 50 |
90-
|Regional provisioned scale increment|50|25|50| 50 |
91-
|Input TPM per PTU |2,500|37,000|230| 3000 |
92-
|Latency Target Value |25 Tokens Per Second|33 Tokens Per Second|25 Tokens Per Second| 44 Tokens Per Second |
85+
|Topic| **gpt-4o** | **gpt-4o-mini** | **o1**| **o3-mini** | **gpt-4.1** |
86+
| --- | --- | --- | --- | --- | --- |
87+
|Global & data zone provisioned minimum deployment|15|15|15| 15 | 15 |
88+
|Global & data zone provisioned scale increment|5|5|5 | 5 | 5 |
89+
|Regional provisioned minimum deployment|50|25|50| 15 | 50 |
90+
|Regional provisioned scale increment|50|25|50| ?? | 50 |
91+
|Input TPM per PTU |2,500|37,000|230| 3000 | |
92+
|Latency Target Value |25 Tokens Per Second|33 Tokens Per Second|25 Tokens Per Second| 44 Tokens Per Second | |
9393

9494
For a full list, see the [Azure OpenAI Service in Azure AI Foundry portal calculator](https://ai.azure.com/resource/calculator).
9595

0 commit comments

Comments
 (0)