Skip to content

Commit 9908ae7

Browse files
authored
fix(inference): update genapi quotas (#5422)
1 parent b3eabc3 commit 9908ae7

File tree

1 file changed

+12
-3
lines changed

1 file changed

+12
-3
lines changed

pages/organizations-and-projects/additional-content/organization-quotas.mdx

Lines changed: 12 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -208,11 +208,16 @@ Generative APIs are rate limited based on:
208208
| llama-3.1-8b-instruct | 200k | 200k |
209209
| deepseek-r1-distill-llama-70b | 200k | 200k |
210210
| mistral-small-3.1-24b-instruct-2503 | 200k | 400k |
211+
| mistral-small-3.2-24b-instruct-2506 | 200k | 400k |
211212
| mistral-nemo-instruct-2407 | 200k | 200k |
212213
| pixtral-12b-2409 | 200k | 200k |
213-
| qwen2.5-32b-instruct | 200k | 200k |
214+
| qwen3-235b-a22b-instruct-2507 | 40k | 40k |
215+
| qwen2.5-coder-32b-instruct | 200k | 400k |
216+
| qwen3-coder-30b-a3b-instruct | 200k | 400k |
217+
| gpt-oss-120b | 200k | 400k |
214218
| bge-multilingual-gemma2 | 200k | 200k |
215219

220+
216221
| Requests per minute | [Payment method validated](/billing/how-to/add-payment-method/#how-to-add-a-credit-card) | Payment method and [identity validated](/account/how-to/verify-identity/) |
217222
|-------------|:----------------------------------------------------------------------------------------------------------:|:-------------------------------------------------------------:|
218223
| gemma-3-27b-it | 300 | 600 |
@@ -221,14 +226,18 @@ Generative APIs are rate limited based on:
221226
| llama-3.1-8b-instruct | 300 | 300 |
222227
| deepseek-r1-distill-llama-70b | 300 | 300 |
223228
| mistral-small-3.1-24b-instruct-2503 | 300 | 600 |
229+
| mistral-small-3.2-24b-instruct-2506 | 300 | 600 |
224230
| mistral-nemo-instruct-2407 | 300 | 300 |
225231
| pixtral-12b-2409 | 300 | 300 |
226-
| qwen2.5-32b-instruct | 300 | 300 |
232+
| qwen3-235b-a22b-instruct-2507 | 300 | 300 |
233+
| qwen2.5-coder-32b-instruct | 300 | 600 |
234+
| qwen3-coder-30b-a3b-instruct | 300 | 600 |
235+
| gpt-oss-120b | 300 | 600 |
227236
| bge-multilingual-gemma2 | 300 | 300 |
228237

229238
| Concurrent requests | [Payment method validated](/billing/how-to/add-payment-method/#how-to-add-a-credit-card) | Payment method and [identity validated](/account/how-to/verify-identity/) |
230239
|-------------|:----------------------------------------------------------------------------------------------------------:|:-------------------------------------------------------------:|
231-
| All models | 25 | 25 |
240+
| All models | 50 | 50 |
232241

233242
## Apple silicon
234243

0 commit comments

Comments
 (0)