Skip to content

Commit 0595d2c

Browse files
committed
update
1 parent 29b8bd2 commit 0595d2c

File tree

2 files changed

+33
-33
lines changed

2 files changed

+33
-33
lines changed

articles/ai-foundry/openai/includes/global-batch-limits.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -22,7 +22,7 @@ The table shows the batch quota limit. Quota values for global batch are represe
2222

2323
### Global batch
2424

25-
|Model|Enterprise agreement|Default| Monthly credit card based subscriptions | MSDN subscriptions | Azure for Students, Free Trials |
25+
|Model|Enterprise & MCA-E|Default| Monthly credit card based subscriptions | MSDN subscriptions | Azure for Students, Free Trials |
2626
|---|---|---|---|---|---|
2727
| `gpt-4.1`| 5 B | 200 M | 50 M | 90 K | N/A |
2828
| `gpt-4.1 mini` | 15B | 1B | 50M | 90k | N/A |
@@ -39,7 +39,7 @@ B = billion | M = million | K = thousand
3939

4040
### Data zone batch
4141

42-
|Model|Enterprise agreement|Default| Monthly credit card based subscriptions | MSDN subscriptions | Azure for Students, Free Trials |
42+
|Model|Enterprise & MCA-E|Default| Monthly credit card based subscriptions | MSDN subscriptions | Azure for Students, Free Trials |
4343
|---|---|---|---|---|---|
4444
| `gpt-4.1` | 500 M | 30 M | 30 M | 90 K | N/A|
4545
| `gpt-4.1-mini` | 1.5 B | 100 M | 50 M | 90 K | N/A |

articles/ai-foundry/openai/quotas-limits.md

Lines changed: 31 additions & 31 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ description: Quick reference, detailed description, and best practices on the qu
44
author: mrbullwinkle
55
ms.author: mbullwin
66
manager: nitinme
7-
ms.date: 07/02/2025
7+
ms.date: 07/11/2025
88
ms.service: azure-ai-openai
99
ms.topic: conceptual
1010
ms.custom:
@@ -15,7 +15,7 @@ ms.custom:
1515

1616
# Azure OpenAI in Azure AI Foundry Models quotas and limits
1717

18-
This article contains a quick reference and a detailed description of the quotas and limits for Azure OpenAI.
18+
This article contains a quick reference and a detailed description of the quotas and limits for Azure OpenAI. Quota is not restricted at the tenant level. At its highest level, quota is scoped per individual Azure subscription. Tokens per minute (TPM) and Requests per minute (RPM) quota limits for each model and deployment type are set per region. For example, if `gpt-4.1` global standard has 5 million TPM and 5,000 RPM, each region where the [model/deployment type is available](./concepts/models.md) can use up that amount of quota for an individual subscription. Quota is not shared cross region.
1919

2020
## Quotas and limits reference
2121

@@ -70,29 +70,29 @@ The following sections provide you with a quick guide to the default quotas and
7070

7171
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
7272
|---|---|:---:|:---:|
73-
| `gpt-4.5` | Enterprise Tier | 200 K | 200 |
73+
| `gpt-4.5` | Enterprise & MCA-E | 200 K | 200 |
7474
| `gpt-4.5` | Default | 150 K | 150 |
7575

7676
### GPT-4.1 series global standard
7777

7878
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
7979
|---|---|:---:|:---:|
80-
| `gpt-4.1` (2025-04-14) | Enterprise Tier | 5 M | 5 K |
80+
| `gpt-4.1` (2025-04-14) | Enterprise & MCA-E | 5 M | 5 K |
8181
| `gpt-4.1` (2025-04-14) | Default | 1 M | 1 K |
82-
| `gpt-4.1-nano` (2025-04-14) | Enterprise Tier | 150 M | 150 K |
82+
| `gpt-4.1-nano` (2025-04-14) | Enterprise & MCA-E | 150 M | 150 K |
8383
| `gpt-4.1-nano` (2025-04-14) | Default | 5 M | 5 K |
84-
| `gpt-4.1-mini` (2025-04-14) | Enterprise Tier | 150 M | 150 K |
84+
| `gpt-4.1-mini` (2025-04-14) | Enterprise & MCA-E | 150 M | 150 K |
8585
| `gpt-4.1-mini` (2025-04-14) | Default | 5 M | 5 K |
8686

8787
### GPT-4.1 series data zone standard
8888

8989
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
9090
|---|---|:---:|:---:|
91-
| `gpt-4.1` (2025-04-14) | Enterprise Tier | 2 M | 2 K |
91+
| `gpt-4.1` (2025-04-14) | Enterprise & MCA-E | 2 M | 2 K |
9292
| `gpt-4.1` (2025-04-14) | Default | 300 K | 300 |
93-
| `gpt-4.1-nano` (2025-04-14) | Enterprise Tier | 50 M | 50 K |
93+
| `gpt-4.1-nano` (2025-04-14) | Enterprise & MCA-E | 50 M | 50 K |
9494
| `gpt-4.1-nano` (2025-04-14) | Default | 2 M | 2 K |
95-
| `gpt-4.1-mini` (2025-04-14) | Enterprise Tier | 50 M | 50 K |
95+
| `gpt-4.1-mini` (2025-04-14) | Enterprise & MCA-E | 50 M | 50 K |
9696
| `gpt-4.1-mini` (2025-04-14) | Default | 2 M | 2 K |
9797

9898
### GPT-4 Turbo
@@ -101,21 +101,21 @@ The following sections provide you with a quick guide to the default quotas and
101101

102102
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
103103
|---|---|:---:|:---:|
104-
|`gpt-4` (turbo-2024-04-09) | Enterprise agreement | 2 M | 12 K |
104+
|`gpt-4` (turbo-2024-04-09) | Enterprise & MCA-E | 2 M | 12 K |
105105
|`gpt-4` (turbo-2024-04-09) | Default | 450 K | 2.7 K |
106106

107107
## model-router rate limits
108108

109109
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
110110
|---|---|:---:|:---:|
111-
| `model-router` (2025-05-19) | Enterprise Tier | 10 M | 10 K |
111+
| `model-router` (2025-05-19) | Enterprise & MCA-E | 10 M | 10 K |
112112
| `model-router` (2025-05-19) | Default | 1 M | 1 K |
113113

114114
## computer-use-preview global standard rate limits
115115

116116
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
117117
|---|---|:---:|:---:|
118-
| `computer-use-preview`| Enterprise Tier | 30 M | 300 K |
118+
| `computer-use-preview`| Enterprise & MCA-E | 30 M | 300 K |
119119
| `computer-use-preview`| Default | 450 K | 4.5 K |
120120

121121
## o-series rate limits
@@ -139,13 +139,13 @@ The following sections provide you with a quick guide to the default quotas and
139139

140140
| Model |Tier | Quota Limit in tokens per minute (TPM) | Requests per minute |
141141
|--------------------|------------------------|:--------------------------------------:|:---: |
142-
| `codex-mini` | Enterprise agreement | 10 M | 10 K |
143-
| `o3-pro` | Enterprise agreement | 16 M | 1.6 K |
144-
| `o4-mini` | Enterprise agreement | 10 M | 10 K |
145-
| `o3` | Enterprise agreement | 10 M | 10 K |
146-
| `o3-mini` | Enterprise agreement | 50 M | 5 K |
147-
| `o1` & `o1-preview`| Enterprise agreement | 30 M | 5 K |
148-
| `o1-mini` | Enterprise agreement | 50 M | 5 K |
142+
| `codex-mini` | Enterprise & MCA-E | 10 M | 10 K |
143+
| `o3-pro` | Enterprise & MCA-E | 16 M | 1.6 K |
144+
| `o4-mini` | Enterprise & MCA-E | 10 M | 10 K |
145+
| `o3` | Enterprise & MCA-E | 10 M | 10 K |
146+
| `o3-mini` | Enterprise & MCA-E | 50 M | 5 K |
147+
| `o1` & `o1-preview`| Enterprise & MCA-E | 30 M | 5 K |
148+
| `o1-mini` | Enterprise & MCA-E | 50 M | 5 K |
149149
| `codex-mini` | Default | 1 M | 1 K |
150150
| `o3-pro` | Default | 1.6 M | 160 |
151151
| `o4-mini` | Default | 1 M | 1 K |
@@ -158,17 +158,17 @@ The following sections provide you with a quick guide to the default quotas and
158158

159159
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
160160
|---|---|:---:|:---:|
161-
| `o3-mini` | Enterprise agreement | 20 M | 2 K |
161+
| `o3-mini` | Enterprise & MCA-E | 20 M | 2 K |
162162
| `o3-mini` | Default | 2 M | 200 |
163-
| `o1` | Enterprise agreement | 6 M | 1 K |
163+
| `o1` | Enterprise & MCA-E | 6 M | 1 K |
164164
| `o1` | Default | 600 K | 100 |
165165

166166
### o1-preview & o1-mini standard
167167

168168
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
169169
|---|---|:---:|:---:|
170-
| `o1-preview` | Enterprise agreement | 600 K | 100 |
171-
| `o1-mini`| Enterprise agreement | 1 M | 100 |
170+
| `o1-preview` | Enterprise & MCA-E | 600 K | 100 |
171+
| `o1-mini`| Enterprise & MCA-E | 1 M | 100 |
172172
| `o1-preview` | Default | 300 K | 50 |
173173
| `o1-mini`| Default | 500 K | 50 |
174174

@@ -180,8 +180,8 @@ The following sections provide you with a quick guide to the default quotas and
180180

181181
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
182182
|---|---|:---:|:---:|
183-
|`gpt-4o`|Enterprise agreement | 30 M | 180 K |
184-
|`gpt-4o-mini` | Enterprise agreement | 50 M | 300 K |
183+
|`gpt-4o`|Enterprise & MCA-E | 30 M | 180 K |
184+
|`gpt-4o-mini` | Enterprise & MCA-E | 50 M | 300 K |
185185
|`gpt-4o` |Default | 450 K | 2.7 K |
186186
|`gpt-4o-mini` | Default | 2 M | 12 K |
187187

@@ -191,8 +191,8 @@ M = million | K = thousand
191191

192192
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
193193
|---|---|:---:|:---:|
194-
|`gpt-4o`|Enterprise agreement | 10 M | 60 K |
195-
|`gpt-4o-mini` | Enterprise agreement | 20 M | 120 K |
194+
|`gpt-4o`|Enterprise & MCA-E | 10 M | 60 K |
195+
|`gpt-4o-mini` | Enterprise & MCA-E | 20 M | 120 K |
196196
|`gpt-4o` |Default | 300 K | 1.8 K |
197197
|`gpt-4o-mini` | Default | 1 M | 6 K |
198198

@@ -203,8 +203,8 @@ M = million | K = thousand
203203

204204
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
205205
|---|---|:---:|:---:|
206-
|`gpt-4o`|Enterprise agreement | 1 M | 6 K |
207-
|`gpt-4o-mini` | Enterprise agreement | 2 M | 12 K |
206+
|`gpt-4o`|Enterprise & MCA-E | 1 M | 6 K |
207+
|`gpt-4o-mini` | Enterprise & MCA-E | 2 M | 12 K |
208208
|`gpt-4o`|Default | 150 K | 900 |
209209
|`gpt-4o-mini` | Default | 450 K | 2.7 K |
210210

@@ -229,7 +229,7 @@ M = million | K = thousand
229229

230230
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
231231
|---|---|:---:|:---:|
232-
|`gpt-image-1`|Enterprise agreement | N/A | 20 |
232+
|`gpt-image-1`|Enterprise & MCA-E | N/A | 20 |
233233
|`gpt-image-1` |Default | N/A | 6 |
234234

235235

@@ -317,7 +317,7 @@ az rest --method GET --uri "https://management.azure.com/subscriptions/{sub-id}?
317317

318318
| Quota allocation/Offer type | Subscription quota ID |
319319
|:---|:----|
320-
| Enterprise | `EnterpriseAgreement_2014-09-01` |
320+
| Enterprise & MCA-E | `EnterpriseAgreement_2014-09-01` |
321321
| Pay-as-you-go | `PayAsYouGo_2014-09-01`|
322322
| MSDN | `MSDN_2014-09-01` |
323323
| CSP Integration Sandbox | `CSPDEVTEST_2018-05-01` |

0 commit comments

Comments
 (0)