Skip to content

Commit 1cb8d66

Browse files
Merge pull request #606 from mrbullwinkle/mrb_09_30_2024_o1_preview_quota
[Azure OpenAI] Release branch quota
2 parents 0bcedc5 + 6e98c46 commit 1cb8d66

File tree

1 file changed

+20
-0
lines changed

1 file changed

+20
-0
lines changed

articles/ai-services/openai/quotas-limits.md

Lines changed: 20 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -60,6 +60,26 @@ The following sections provide you with a quick guide to the default quotas and
6060

6161
[!INCLUDE [Quota](./includes/global-batch-limits.md)]
6262

63+
## o1-preview & o1-mini rate limits
64+
65+
### o1-preview & o1-mini global standard
66+
67+
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
68+
|---|---|:---:|:---:|
69+
| `o1-preview` | Enterprise agreement | 15 M | 2.5 K |
70+
| `o1-mini`| Enterprise agreement | 50 M | 5 K |
71+
| `o1-preview` | Default | 1.5 M | 250 |
72+
| `o1-mini`| Default | 20 M | 2 K |
73+
74+
### o1-preview & o1-mini standard
75+
76+
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
77+
|---|---|:---:|:---:|
78+
| `o1-preview` | Enterprise agreement | 300 K | 50 |
79+
| `o1-mini`| Enterprise agreement | 1 M | 100 |
80+
| `o1-preview` | Default | 300 K | 50 |
81+
| `o1-mini`| Default | 1 M | 100 |
82+
6383
## gpt-4o & GPT-4 Turbo rate limits
6484

6585
`gpt-4o` and `gpt-4o-mini`, and `gpt-4` (`turbo-2024-04-09`) have rate limit tiers with higher limits for certain customer types.

0 commit comments

Comments
 (0)