Skip to content

Commit b0127ab

Browse files
committed
Fixed phrasing on th throughput
1 parent 4a6d02d commit b0127ab

File tree

1 file changed

+3
-5
lines changed

1 file changed

+3
-5
lines changed

articles/ai-services/openai/concepts/provisioned-throughput.md

Lines changed: 3 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -38,14 +38,12 @@ An Azure OpenAI Deployment is a unit of management for a specific OpenAI Model.
3838

3939

4040
## How much thoughput per PTU you get for each model
41-
The amount of throughput (tokens per minute or TPM) a deployment gets per PTU is a function of the input and output tokens being generated.
42-
43-
Generating output tokens requires more processing and the more tokens generated, the lower the overall TPM per PTU. Provisioned deployments dynamically balance the two, so users do not have to set specific input and output limits. This approach means the service is resilient to fluctuations in the workload shape.
41+
The amount of throughput (tokens per minute or TPM) a deployment gets per PTU is a function of the input and output tokens in the minute. Generating output tokens requires more processing than input tokens and so the more output tokens generated the lower your overall TPM. The service dynamically balances the input & output costs, so users do not have to set specific input and output limits. This approach means your deployment is resilient to fluctuations in the workload shape.
4442

4543
To help with simplifying the sizing effort, the following table outlines the TPM per PTU for the `gpt-4o` and `gpt-4o-mini` models
4644

47-
| | **gpt-4o**, **2024-05-13** & **gpt-4o**, **2024-08-06** | **gpt-4o-mini**, **2024-07-18** |
48-
| --| --| --|
45+
| Variable | **gpt-4o**, **2024-05-13** & **gpt-4o**, **2024-08-06** | **gpt-4o-mini**, **2024-07-18** |
46+
| --- | --- | --- |
4947
| Deployable Increments | 50 | 25|
5048
| Input TPM per PTU | 2,500 | 37,000 |
5149
| Output TPM per PTU | 833 | 12,333 |

0 commit comments

Comments
 (0)