You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
| Requests per minute |Flux-Pro 1.1<br />Flux.1-Kontext Pro<br /> | 2 capacity units (6 requests per minute) |
44
-
| Tokens per minute | Rest of models | 400,000 |
45
-
| Requests per minute | Rest of models | 1,000 |
46
-
| Concurrent requests | Rest of models | 300 |
47
-
48
-
For Azure OpenAI quota increase request, use [request a quota increase](https://customervoice.microsoft.com/Pages/ResponsePage.aspx?id=v4j5cvGGr0GRqy180BHbR4xPXO648sJKt4GoXAed-0pUMFE1Rk9CU084RjA0TUlVSUlMWEQzVkJDNCQlQCN0PWcu) to submit your request. For other models, You can [request increases to the default limits](#request-increases-to-the-default-limits). Due to high demand, limit increase requests can be submitted and are evaluated per request.
33
+
The following table lists limits for Foundry Models for the following rates:
34
+
35
+
- Tokens per minute
36
+
- Requests per minute
37
+
- Concurrent request
38
+
39
+
| Models | Tokens per minute | Requests per minute | Concurrent requests |
| Azure OpenAI models | Varies per model and SKU. See [limits for Azure OpenAI](../openai/quotas-limits.md). | Varies per model and SKU. See [limits for Azure OpenAI](../openai/quotas-limits.md). | not applicable |
| Flux-Pro 1.1<br />Flux.1-Kontext Pro | not applicable | 2 capacity units (6 requests per minute) | not applicable |
45
+
| Rest of models | 400,000 | 1,000 | 300 |
46
+
47
+
To increase your quota:
48
+
49
+
- For Azure OpenAI, use [Azure AI Foundry Service: Request for Quota Increase](https://customervoice.microsoft.com/Pages/ResponsePage.aspx?id=v4j5cvGGr0GRqy180BHbR4xPXO648sJKt4GoXAed-0pUMFE1Rk9CU084RjA0TUlVSUlMWEQzVkJDNCQlQCN0PWcu) to submit your request.
50
+
- For other models, see [request increases to the default limits](#request-increases-to-the-default-limits).
51
+
52
+
Due to high demand, we evaluate limit increase requests per request.
0 commit comments