You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-foundry/openai/quotas-limits.md
+29-17Lines changed: 29 additions & 17 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -4,7 +4,7 @@ description: Quick reference, detailed description, and best practices on the qu
4
4
author: mrbullwinkle
5
5
ms.author: mbullwin
6
6
manager: nitinme
7
-
ms.date: 05/29/2025
7
+
ms.date: 07/02/2025
8
8
ms.service: azure-ai-openai
9
9
ms.topic: conceptual
10
10
ms.custom:
@@ -72,16 +72,27 @@ The following sections provide you with a quick guide to the default quotas and
72
72
|`gpt-4.5`| Enterprise Tier | 200 K | 200 |
73
73
|`gpt-4.5`| Default | 150 K | 150 |
74
74
75
-
### GPT-4.1 series
75
+
### GPT-4.1 series global standard
76
76
77
77
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
78
78
|---|---|:---:|:---:|
79
79
|`gpt-4.1` (2025-04-14) | Enterprise Tier | 5 M | 5 K |
80
80
|`gpt-4.1` (2025-04-14) | Default | 1 M | 1 K |
81
-
|`gpt-4.1-nano` (2025-04-14) | Enterprise Tier | 5 M | 5 K |
82
-
|`gpt-4.1-nano` (2025-04-14) | Default | 1 M | 1 K |
83
-
|`gpt-4.1-mini` (2025-04-14) | Enterprise Tier | 5 M | 5 K |
84
-
|`gpt-4.1-mini` (2025-04-14) | Default | 1 M | 1 K |
81
+
|`gpt-4.1-nano` (2025-04-14) | Enterprise Tier | 150 M | 150 K |
82
+
|`gpt-4.1-nano` (2025-04-14) | Default | 5 M | 5 K |
83
+
|`gpt-4.1-mini` (2025-04-14) | Enterprise Tier | 150 M | 150 K |
84
+
|`gpt-4.1-mini` (2025-04-14) | Default | 5 M | 5 K |
85
+
86
+
### GPT-4.1 series data zone standard
87
+
88
+
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
89
+
|---|---|:---:|:---:|
90
+
|`gpt-4.1` (2025-04-14) | Enterprise Tier | 2 M | 2 K |
91
+
|`gpt-4.1` (2025-04-14) | Default | 300 K | 300 |
92
+
|`gpt-4.1-nano` (2025-04-14) | Enterprise Tier | 50 M | 50 K |
93
+
|`gpt-4.1-nano` (2025-04-14) | Default | 2 M | 2 K |
94
+
|`gpt-4.1-mini` (2025-04-14) | Enterprise Tier | 50 M | 50 K |
95
+
|`gpt-4.1-mini` (2025-04-14) | Default | 2 M | 2 K |
85
96
86
97
### GPT-4 Turbo
87
98
@@ -230,19 +241,20 @@ The Usage Limit determines the level of usage above which customers might see la
230
241
> [!NOTE]
231
242
> Usage tiers only apply to standard, data zone standard, and global standard deployment types. Usage tiers don't apply to global batch and provisioned throughput deployments.
232
243
233
-
### GPT-4o global standard, data zone standard, & standard
244
+
### Global standard, data zone standard, & standard
0 commit comments