Skip to content

Commit a6038e0

Browse files
fpagnybene2k1nerda-codes
authored andcommitted
Update rate-limits.mdx (scaleway#4229)
* Update rate-limits.mdx Update embeddings related quotas. Add detail about Free Tier quotas limitations. * Apply suggestions from code review Co-authored-by: Néda <[email protected]> --------- Co-authored-by: Benedikt Rollik <[email protected]> Co-authored-by: Néda <[email protected]>
1 parent bce4893 commit a6038e0

File tree

1 file changed

+9
-5
lines changed

1 file changed

+9
-5
lines changed

ai-data/generative-apis/reference-content/rate-limits.mdx

Lines changed: 9 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -17,9 +17,13 @@ Any model served through Scaleway Generative APIs gets limited by:
1717
- Tokens per minute
1818
- Queries per minute
1919

20+
<Message type="tip">
21+
These limits only apply if you created a Scaleway Account and registered a valid payment method. Otherwise, stricter limits apply to ensure usage stays within Free Tier only.
22+
</Message>
23+
2024
### Chat models
2125

22-
| Model string | Requests per minute | Tokens per minute |
26+
| Model string | Requests per minute | Total tokens per minute |
2327
|-----------------|-----------------|-----------------|
2428
| `llama-3.1-8b-instruct` | 300 | 100K |
2529
| `llama-3.1-70b-instruct` | 300 | 100K |
@@ -29,10 +33,10 @@ Any model served through Scaleway Generative APIs gets limited by:
2933

3034
### Embedding models
3135

32-
| Model string | Requests per minute | Tokens per minute |
36+
| Model string | Requests per minute | Input tokens per minute |
3337
|-----------------|-----------------|-----------------|
34-
| `sentence-t5-xxl` | 600 | 1M |
35-
| `bge-multilingual-gemma2` | 600 | 1M |
38+
| `sentence-t5-xxl` | 100 | 200K |
39+
| `bge-multilingual-gemma2` | 100 | 200K |
3640

3741
## Why do we set rate limits?
3842

@@ -41,4 +45,4 @@ These limits safeguard against abuse or misuse of Scaleway Generative APIs, help
4145
## How can I increase the rate limits?
4246

4347
We actively monitor usage and will improve rates based on feedback.
44-
If you need to increase your rate limits, contact us via the support team, providing details on the model used and specific use case.
48+
If you need to increase your rate limits, contact us via the support team, providing details on the model used and specific use case.

0 commit comments

Comments
 (0)