You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/cognitive-services/openai/quotas-limits.md
+9-1Lines changed: 9 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -23,7 +23,7 @@ The following sections provide you with a quick guide to the quotas and limits t
23
23
| Limit Name | Limit Value |
24
24
|--|--|
25
25
| OpenAI resources per region | 2 |
26
-
| Requests per minute per model*| Davinci-models (002 and later): 120 <br> ChatGPT model (preview): 300 <br> GPT-4 models (preview): 12 <br> All other models: 300 |
26
+
| Requests per minute per model*| Davinci-models (002 and later): 120 <br> ChatGPT model (preview): 300 <br> GPT-4 models (preview): 18 <br> All other models: 300 |
27
27
| Tokens per minute per model*| Davinci-models (002 and later): 40,000 <br> ChatGPT model: 120,000 <br> All other models: 120,000 |
28
28
| Max fine-tuned model deployments*| 2 |
29
29
| Ability to deploy same model to multiple deployments | Not allowed |
@@ -37,6 +37,7 @@ The following sections provide you with a quick guide to the quotas and limits t
37
37
38
38
*The limits are subject to change. We anticipate that you will need higher limits as you move toward production and your solution scales. When you know your solution requirements, please reach out to us by applying for a quota increase here: <https://aka.ms/oai/quotaincrease>
39
39
40
+
40
41
For information on max tokens for different models, consult the [models article](./concepts/models.md#model-summary-table-and-region-availability)
41
42
42
43
### General best practices to mitigate throttling during autoscaling
@@ -54,6 +55,13 @@ The next sections describe specific cases of adjusting quotas.
54
55
55
56
If you need to increase the limit, you can apply for a quota increase here: <https://aka.ms/oai/quotaincrease>
56
57
58
+
### How to request an increase to the number of resources per region
59
+
60
+
If you need to increase the number of resources, you can apply for a resource increase here: <https://aka.ms/oai/resourceincrease>
61
+
62
+
> [!NOTE]
63
+
> Ensure that you thoroughly assess your current resource utilization, approaching its full capacity. Be aware that we will not grant additional resources if efficient usage of existing resources is not observed.
64
+
57
65
## Next steps
58
66
59
67
Learn more about the [underlying models that power Azure OpenAI](./concepts/models.md).
0 commit comments