Skip to content

Commit 8c0f5a0

Browse files
authored
feat(genapi): add maximum concurrent requests
1 parent b5dc6d9 commit 8c0f5a0

File tree

1 file changed

+4
-0
lines changed

1 file changed

+4
-0
lines changed

pages/organizations-and-projects/additional-content/organization-quotas.mdx

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -168,6 +168,7 @@ Managed Inference Deployments are limited to a maximum number of nodes, dependin
168168
Generative APIs are rate limited based on:
169169
- Tokens per minute (total input and output tokens)
170170
- Requests per minute
171+
- Concurrent requests (total active HTTP session at the same time)
171172

172173
<Message type="important">
173174
[Contact our support team](https://console.scaleway.com/support/create) if you want to increase your quotas above these limits.
@@ -194,6 +195,9 @@ Generative APIs are rate limited based on:
194195
| qwen2.5-32b-instruct | 300 | 300 |
195196
| bge-multilingual-gemma2 | 300 | 300 |
196197

198+
| Concurrent requests | [Payment method validated](/billing/how-to/add-payment-method/#how-to-add-a-credit-card) | Payment method and [identity validated](/account/how-to/verify-identity/) |
199+
|-------------|:----------------------------------------------------------------------------------------------------------:|:-------------------------------------------------------------:|
200+
| All models | 25 | 25 |
197201

198202
## Apple silicon
199203

0 commit comments

Comments
 (0)