Skip to content

Commit ef9ca35

Browse files
Merge pull request #6665 from mrbullwinkle/mrb_08_19_2025_quota_stuff
[Azure OpenAI] Quota updates
2 parents 7f6fc76 + 0646e41 commit ef9ca35

File tree

1 file changed

+14
-3
lines changed

1 file changed

+14
-3
lines changed

articles/ai-foundry/openai/quotas-limits.md

Lines changed: 14 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ description: This article features detailed descriptions and best practices on t
44
author: mrbullwinkle
55
ms.author: mbullwin
66
manager: nitinme
7-
ms.date: 08/07/2025
7+
ms.date: 08/19/2025
88
ms.service: azure-ai-openai
99
ms.topic: conceptual
1010
ms.custom:
@@ -80,6 +80,13 @@ The following section provides you with a quick guide to the default quotas and
8080
| gpt-5-nano | 5 M | 150 M | 2 M | 50 M |
8181
| gpt-5-chat | 1 M | 5 M | N/A | N/A |
8282

83+
| Model | Global Default<br>Requests per minute (RPM) | Global Enterprise and MCA-E <br>Requests per minute (RPM) | Data Zone Default <br>Requests per minute (RPM) | Data Zone Enterprise and MCA-E <br>Requests per minute (RPM) |
84+
|-------------|----------------------------------------------|------------------------------------------------------------|--------------------------------------------------|--------------------------------------------------------------|
85+
| gpt-5 | 10 K | 100 K | 3 K | 30 K |
86+
| gpt-5-mini | 1 K | 10 K | 300 | 3 K |
87+
| gpt-5-nano | 5 K | 150 K | 2 K | 50 K |
88+
| gpt-5-chat | 1 K | 5 K | N/A | N/A |
89+
8390

8491
[!INCLUDE [Quota](./includes/global-batch-limits.md)]
8592

@@ -206,7 +213,7 @@ The following section provides you with a quick guide to the default quotas and
206213
| Model|Tier| Quota limit in tokens per minute | Requests per minute |
207214
|---|---|:---:|:---:|
208215
|`gpt-4o`|Enterprise and MCA-E | 30M | 180K |
209-
|`gpt-4o-mini` | Enterprise and MCA-E | 50M | 300K |
216+
|`gpt-4o-mini` | Enterprise and MCA-E | 150M | 1.5M |
210217
|`gpt-4o` |Default | 450K | 2.7K |
211218
|`gpt-4o-mini` | Default | 2M | 12K |
212219

@@ -276,11 +283,15 @@ The usage limit determines the level of usage above which customers might see la
276283

277284
If your Azure subscription is linked to certain [offer types](https://azure.microsoft.com/support/legal/offer-details/), your maximum quota values are lower than the values indicated in the previous tables.
278285

286+
- GPT-5 reasoning model quota is 20K TPM and 200 RPM for all offer types that do not have access to MCA-E or default quota. GPT-5-chat is 50K and 50 RPM.
287+
288+
- Some offer types are restricted to only Global Standard deployments in the East US2 and Sweden Central regions.
289+
279290
|Tier| Quota limit in tokens per minute |
280291
|---|:---|
281292
|`Azure for Students` | 1K (all models) <br>Exception o-series, GPT-4.1, and GPT 4.5 Preview: 0|
282293
| `MSDN` | GPT-4o-mini: 200K <br> GPT 3.5 Turbo Series: 200K <br> GPT-4 series: 50K <br>computer-use-preview: 8K <br> gpt-4o-realtime-preview: 1K <br> o-series: 0 <br> GPT 4.5 Preview: 0 <br> GPT-4.1: 50K <br> GPT-4.1-nano: 200K |
283-
|`Standard` | GPT-4o-mini: 200K <br> GPT 3.5 Turbo Series: 200K <br> GPT-4 series: 50K <br>computer-use-preview: 30K <br> o-series: 0 <br> GPT 4.5 Preview: 0 <br> GPT-4.1: 50K <br> GPT-4.1-nano: 200K |
294+
|`Standard`& `Pay-as-you-go` | GPT-4o-mini: 200K <br> GPT 3.5 Turbo Series: 200K <br> GPT-4 series: 50K <br>computer-use-preview: 30K <br> o-series: 0 <br> GPT 4.5 Preview: 0 <br> GPT-4.1: 50K <br> GPT-4.1-nano: 200K |
284295
| `Azure_MS-AZR-0111P` <br> `Azure_MS-AZR-0035P` <br> `Azure_MS-AZR-0025P` <br> `Azure_MS-AZR-0052P` <br>| GPT-4o-mini: 200K <br> GPT 3.5 Turbo Series: 200K <br> GPT-4 series: 50K |
285296
| `CSP Integration Sandbox` <sup>*</sup> | All models: 0 |
286297
| `Lightweight trial`<br>`Free trials`<br>`Azure Pass` | All models: 0 |

0 commit comments

Comments
 (0)