You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-foundry/openai/quotas-limits.md
+14-3Lines changed: 14 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -4,7 +4,7 @@ description: This article features detailed descriptions and best practices on t
4
4
author: mrbullwinkle
5
5
ms.author: mbullwin
6
6
manager: nitinme
7
-
ms.date: 08/07/2025
7
+
ms.date: 08/19/2025
8
8
ms.service: azure-ai-openai
9
9
ms.topic: conceptual
10
10
ms.custom:
@@ -80,6 +80,13 @@ The following section provides you with a quick guide to the default quotas and
80
80
| gpt-5-nano | 5 M | 150 M | 2 M | 50 M |
81
81
| gpt-5-chat | 1 M | 5 M | N/A | N/A |
82
82
83
+
| Model | Global Default<br>Requests per minute (RPM) | Global Enterprise and MCA-E <br>Requests per minute (RPM) | Data Zone Default <br>Requests per minute (RPM) | Data Zone Enterprise and MCA-E <br>Requests per minute (RPM) |
@@ -206,7 +213,7 @@ The following section provides you with a quick guide to the default quotas and
206
213
| Model|Tier| Quota limit in tokens per minute | Requests per minute |
207
214
|---|---|:---:|:---:|
208
215
|`gpt-4o`|Enterprise and MCA-E | 30M | 180K |
209
-
|`gpt-4o-mini`| Enterprise and MCA-E |50M|300K|
216
+
|`gpt-4o-mini`| Enterprise and MCA-E |150M|1.5M|
210
217
|`gpt-4o`|Default | 450K | 2.7K |
211
218
|`gpt-4o-mini`| Default | 2M | 12K |
212
219
@@ -276,11 +283,15 @@ The usage limit determines the level of usage above which customers might see la
276
283
277
284
If your Azure subscription is linked to certain [offer types](https://azure.microsoft.com/support/legal/offer-details/), your maximum quota values are lower than the values indicated in the previous tables.
278
285
286
+
- GPT-5 reasoning model quota is 20K TPM and 200 RPM for all offer types that do not have access to MCA-E or default quota. GPT-5-chat is 50K and 50 RPM.
287
+
288
+
- Some offer types are restricted to only Global Standard deployments in the East US2 and Sweden Central regions.
289
+
279
290
|Tier| Quota limit in tokens per minute |
280
291
|---|:---|
281
292
|`Azure for Students`| 1K (all models) <br>Exception o-series, GPT-4.1, and GPT 4.5 Preview: 0|
0 commit comments