From 2431b3ee146fdef1631385912b16f5d62b89b727 Mon Sep 17 00:00:00 2001 From: fpagny Date: Thu, 6 Mar 2025 17:41:54 +0100 Subject: [PATCH 1/2] feat(genapi): update generative-apis.mdx --- faq/generative-apis.mdx | 6 ++++++ 1 file changed, 6 insertions(+) diff --git a/faq/generative-apis.mdx b/faq/generative-apis.mdx index 592fed96c2..9c5c064136 100644 --- a/faq/generative-apis.mdx +++ b/faq/generative-apis.mdx @@ -29,6 +29,12 @@ Note that: - Cockpits are isolated by Projects, hence you first need to select the right project in the Scaleway console before accessing Cockpit to see your token consumption for this Project (you can see the `project_id` in the Cockpit URL: `https://{project_id}.dashboard.obs.fr-par.scw.cloud/`. - Cockpit graphs can take up to 1 hour to update token consumption, see [Troubleshooting](https://www.scaleway.com/en/docs/generative-apis/troubleshooting/fixing-common-issues/#tokens-consumption-is-not-displayed-in-cockpit-metrics) for further details. +## Can I configure a maximum billing threshold? +Currently, you cannot configure a specific threshold after which your usage will blocked. However: +- You can [configure billing alerts](https://www.scaleway.com/en/docs/billing/how-to/use-billing-alerts/) to ensure you are warned when you hit specific budget thresholds. +- Your total billing remains limited by the amount of tokens you can consume within [rate limits](https://www.scaleway.com/en/docs/generative-apis/reference-content/rate-limits/). +- If you want to ensure a fixed billing, you can use [Managed Inference](https://www.scaleway.com/en/inference/), which provides the same set of OpenAI compatible APIs and a wider range of models. + ## How can I access and use the Generative APIs? Access is open to all Scaleway customers. You can start by using the Generative APIs Playground in the Scaleway console to experiment with different models. For integration into applications, you can use the OpenAI-compatible APIs provided by Scaleway. Detailed instructions are available in our [Quickstart guide](/generative-apis/quickstart/). From c60f98ae06ebfe6fba12948ef95cc6773b4b28f4 Mon Sep 17 00:00:00 2001 From: Benedikt Rollik Date: Fri, 7 Mar 2025 10:34:24 +0100 Subject: [PATCH 2/2] Apply suggestions from code review Co-authored-by: Rowena Jones <36301604+RoRoJ@users.noreply.github.com> --- faq/generative-apis.mdx | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/faq/generative-apis.mdx b/faq/generative-apis.mdx index 9c5c064136..c24f8ff94f 100644 --- a/faq/generative-apis.mdx +++ b/faq/generative-apis.mdx @@ -31,9 +31,9 @@ Note that: ## Can I configure a maximum billing threshold? Currently, you cannot configure a specific threshold after which your usage will blocked. However: -- You can [configure billing alerts](https://www.scaleway.com/en/docs/billing/how-to/use-billing-alerts/) to ensure you are warned when you hit specific budget thresholds. -- Your total billing remains limited by the amount of tokens you can consume within [rate limits](https://www.scaleway.com/en/docs/generative-apis/reference-content/rate-limits/). -- If you want to ensure a fixed billing, you can use [Managed Inference](https://www.scaleway.com/en/inference/), which provides the same set of OpenAI compatible APIs and a wider range of models. +- You can [configure billing alerts](/billing/how-to/use-billing-alerts/) to ensure you are warned when you hit specific budget thresholds. +- Your total billing remains limited by the amount of tokens you can consume within [rate limits](/generative-apis/reference-content/rate-limits/). +- If you want to ensure fixed billing, you can use [Managed Inference](https://www.scaleway.com/en/inference/), which provides the same set of OpenAI-compatible APIs and a wider range of models. ## How can I access and use the Generative APIs? Access is open to all Scaleway customers. You can start by using the Generative APIs Playground in the Scaleway console to experiment with different models. For integration into applications, you can use the OpenAI-compatible APIs provided by Scaleway. Detailed instructions are available in our [Quickstart guide](/generative-apis/quickstart/).