Skip to content

Commit a7a0825

Browse files
committed
update
1 parent 0595d2c commit a7a0825

File tree

1 file changed

+11
-1
lines changed

1 file changed

+11
-1
lines changed

articles/ai-foundry/openai/quotas-limits.md

Lines changed: 11 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,17 @@ ms.custom:
1515

1616
# Azure OpenAI in Azure AI Foundry Models quotas and limits
1717

18-
This article contains a quick reference and a detailed description of the quotas and limits for Azure OpenAI. Quota is not restricted at the tenant level. At its highest level, quota is scoped per individual Azure subscription. Tokens per minute (TPM) and Requests per minute (RPM) quota limits for each model and deployment type are set per region. For example, if `gpt-4.1` global standard has 5 million TPM and 5,000 RPM, each region where the [model/deployment type is available](./concepts/models.md) can use up that amount of quota for an individual subscription. Quota is not shared cross region.
18+
This article contains a quick reference and a detailed description of the quotas and limits for Azure OpenAI.
19+
20+
**Scope of quota**:
21+
22+
- Quotas and limits are not enforced at the tenant level.
23+
- Instead, the highest level of quota restrictions are scoped at the Azure subscription level.
24+
25+
**Regional quota allocation:**
26+
27+
- Tokens per minute (TPM) and requests per minute (RPM) limits are defined **per region, per subscription, and per model/deployment type**.
28+
- For example, if the `gpt-4.1` global standard model is listed with a quota of **5 million TPM and 5,000 RPM**, then **each region** where that [model/deployment type is available](./concepts/models.md) has its own dedicated pool of quota of that amount for **each of your Azure subscriptions**. So within a single Azure subscription, it is possible to use a larger quantity of total TPM/RPM quota for a given model/deployment type, as long as you have resources/model deployments spread across multiple regions.
1929

2030
## Quotas and limits reference
2131

0 commit comments

Comments
 (0)