You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-foundry/openai/quotas-limits.md
+6-6Lines changed: 6 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -29,7 +29,7 @@ For example, if the `gpt-4.1` Global Standard model is listed with a quota of *5
29
29
30
30
## Quotas and limits reference
31
31
32
-
The following sections provide you with a quick guide to the default quotas and limits that apply to Azure OpenAI:
32
+
The following section provides you with a quick guide to the default quotas and limits that apply to Azure OpenAI:
33
33
34
34
| Limit name | Limit value |
35
35
|--|--|
@@ -234,7 +234,7 @@ The rate limits for each `gpt-4o` audio model deployment are 100,000 tokens per
234
234
235
235
## Usage tiers
236
236
237
-
Global Standard deployments use the global infrastructure of Azure. They dynamically route customer traffic to the data center with the best availability for the customer’s inference requests. Similarly, Data Zone Standard deployments allow you to use the global infrastructure of Azure to dynamically route traffic to the data center within the Microsoft-defined data zone with the best availability for each request. This practice enables more consistent latency for customers with low to medium levels of traffic. Customers with high sustained levels of usage might see greater variability in response latency.
237
+
Global Standard deployments use the global infrastructure of Azure. They dynamically route customer traffic to the data center with the best availability for the customer's inference requests. Similarly, Data Zone Standard deployments allow you to use the global infrastructure of Azure to dynamically route traffic to the data center within the Microsoft-defined data zone with the best availability for each request. This practice enables more consistent latency for customers with low to medium levels of traffic. Customers with high sustained levels of usage might see greater variability in response latency.
238
238
239
239
The usage limit determines the level of usage above which customers might see larger variability in response latency. A customer's usage is defined per model. It's the total number of tokens consumed across all deployments in all subscriptions in all regions for a given tenant.
240
240
@@ -269,9 +269,9 @@ If your Azure subscription is linked to certain [offer types](https://azure.micr
269
269
|`CSP Integration Sandbox` <sup>*</sup> | All models: 0 |
270
270
|`Lightweight trial`<br>`Free trials`<br>`Azure Pass`| All models: 0 |
271
271
272
-
<sup>*</sup>This limit applies to only a small number of legacy CSP sandbox subscriptions. Use the following query to determine what `quotaId` is associated with your subscription.
272
+
<sup>*</sup>This limit applies to only a small number of legacy CSP sandbox subscriptions. Use the following query to determine what `quotaId`value is associated with your subscription.
273
273
274
-
To determine the offer type associated with your subscription, you can check your `quotaId`. If your `quotaId` isn't listed in this table, your subscription qualifies for default quota.
274
+
To determine the offer type associated with your subscription, you can check your `quotaId` value. If your `quotaId`value isn't listed in this table, your subscription qualifies for the default quota.
275
275
276
276
# [REST](#tab/REST)
277
277
@@ -339,9 +339,9 @@ To minimize issues related to rate limits, it's a good idea to use the following
339
339
340
340
## Request quota increases
341
341
342
-
Quota increase requests can be submitted via the [quota increase request form](https://aka.ms/oai/stuquotarequest). Due to high demand, quota increase requests are being accepted and filled in the order they're received. Priority is given to customers who generate traffic that consumes the existing quota allocation. Your request might be denied if this condition isn't met.
342
+
Quota increase requests can be submitted via the [quota increase request form](https://aka.ms/oai/stuquotarequest). Due to high demand, quota increase requests are accepted and filled in the order they're received. Priority is given to customers who generate traffic that consumes the existing quota allocation. Your request might be denied if this condition isn't met.
343
343
344
-
For other rate limits, [submit a service request](../../ai-services/cognitive-services-support-options.md?context=/azure/ai-foundry/openai/context/context).
344
+
You can [submit a service request](../../ai-services/cognitive-services-support-options.md?context=/azure/ai-foundry/openai/context/context) for other rate limits.
0 commit comments