Skip to content

Commit 287a9d7

Browse files
committed
Learn Editor: Update latency.md
1 parent 6400c9b commit 287a9d7

File tree

1 file changed

+1
-2
lines changed

1 file changed

+1
-2
lines changed

articles/ai-services/openai/how-to/latency.md

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -71,8 +71,7 @@ Here are a few examples for the GPT-4o mini model:
7171
|5,000 |50 |1,000|5,000,000|50,000|5,050,000|140|
7272
|1,000 |300 | 500 |500,000|150,000|650,000|30|
7373

74-
The number of PTUs scales roughly linearly with call rate (might be sublinear) when the workload distribution remains constant.
75-
74+
The number of PTUs scales roughly linearly with call rate when the workload distribution remains constant.
7675

7776
### Latency: The per-call response times
7877

0 commit comments

Comments
 (0)