Skip to content

Commit 82e2db2

Browse files
committed
Learn Editor: Update provisioned-throughput.md
1 parent 40099a8 commit 82e2db2

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

articles/ai-services/openai/concepts/provisioned-throughput.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -44,7 +44,7 @@ The amount of throughput (tokens per minute or TPM) a deployment gets per PTU is
4444

4545
To help with simplifying the sizing effort, the following table outlines the TPM per PTU for the specified models. To understand the impact of output tokens on the TPM per PTU limit, use the 3 input token to 1 output token ratio. For a detailed understanding of how different ratios of input and output tokens impact your TPM per PTU, see the [Azure OpenAI capacity calculator](https://oai.azure.com/portal/calculator). The table also shows Service Level Agreement (SLA) Latency Target Values per model. For more information about the SLA for Azure OpenAI Service, see the [Service Level Agreements (SLA) for Online Services page](https://www.microsoft.com/licensing/docs/view/Service-Level-Agreements-SLA-for-Online-Services?lang=1)
4646

47-
|Topic| **gpt-4o**, **2024-05-13** & **gpt-4o**, **2024-08-06** | **gpt-4o-mini**, **2024-07-18** |
47+
|Topic| **gpt-4o** | **gpt-4o-mini** |
4848
| --- | --- | --- |
4949
|Global & data zone provisioned minimum deployment|15|15|
5050
|Global & data zone provisioned scale increment|5|5|

0 commit comments

Comments
 (0)