Skip to content

Commit 0751ca0

Browse files
authored
fix(genapi): update troubleshooting code for concurrent requests
1 parent e0a02ec commit 0751ca0

File tree

1 file changed

+14
-4
lines changed

1 file changed

+14
-4
lines changed

pages/generative-apis/troubleshooting/fixing-common-issues.mdx

Lines changed: 14 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -31,14 +31,24 @@ Below are common issues that you may encounter when using Generative APIs, their
3131

3232
### Cause
3333
- You performed too many API requests over a given minute
34-
- You consumed too many tokens (input and output) with your API requests over a given minute
34+
- You consumed too many tokens (input and output) with your API requests over a given minute
3535

3636
### Solution
37-
- [Ask our support](https://console.scaleway.com/support/tickets/create) to raise your quota
38-
- Smooth out your API requests rate by limiting the number of API requests you perform in parallel
39-
- Reduce the size of the input or output tokens processed by your API requests
37+
- Smooth out your API requests rate by limiting the number of API requests you perform over a given minute so that you remain below your [organization quotas for Generative APIs](https://www.scaleway.com/en/docs/organizations-and-projects/additional-content/organization-quotas/#generative-apis).
38+
- [Add a payment method](https://www.scaleway.com/en/docs/billing/how-to/add-payment-method/#how-to-add-a-credit-card) and [validate your identity](https://www.scaleway.com/en/docs/account/how-to/verify-identity/) to increase automatically your quotas [based on standard limits](https://www.scaleway.com/en/docs/organizations-and-projects/additional-content/organization-quotas/#generative-apis).
39+
- [Ask our support](https://console.scaleway.com/support/tickets/create) to raise your quota.
40+
- Reduce the size of the input or output tokens processed by your API requests.
4041
- Use [Managed Inference](/managed-inference/), where these quota do not apply (your throughput will be only limited by the amount of Inference Deployment your provision)
4142

43+
## 429: Too Many Requests - You exceeded your current threshold of concurrent requests
44+
45+
### Cause
46+
- You kept too many API requests opened at the same time (number of HTTP sessions opened in parallel)
47+
48+
### Solution
49+
- Smooth out your API requests rate by limiting the number of API requests you perform at the same time (eg. requests which did not receive a complete response and are still opened) so that you remain below your [organization quotas for Generative APIs](https://www.scaleway.com/en/docs/organizations-and-projects/additional-content/organization-quotas/#generative-apis).
50+
- Use [Managed Inference](/managed-inference/), where concurrent request limit do not apply. Note that exceeding the number of concurrent requests your Inference Deployment can handle may impact performance metrics.
51+
4252

4353
## 504: Gateway Timeout
4454

0 commit comments

Comments
 (0)