Skip to content

Commit c7aeb1b

Browse files
authored
feat(genapi): add faq about token consumption monitoring
1 parent d9d9030 commit c7aeb1b

File tree

1 file changed

+22
-0
lines changed

1 file changed

+22
-0
lines changed

pages/generative-apis/faq.mdx

Lines changed: 22 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -29,6 +29,28 @@ Note that:
2929
- Cockpits are isolated by Projects, hence you first need to select the right project in the Scaleway console before accessing Cockpit to see your token consumption for this Project (you can see the `project_id` in the Cockpit URL: `https://{project_id}.dashboard.obs.fr-par.scw.cloud/`.
3030
- Cockpit graphs can take up to 1 hour to update token consumption, see [Troubleshooting](https://www.scaleway.com/en/docs/generative-apis/troubleshooting/fixing-common-issues/#tokens-consumption-is-not-displayed-in-cockpit-metrics) for further details.
3131

32+
## How can I give access to token consumption to my users outside of Scaleway?
33+
If your users do not have a Scaleway account, you can still give them access to their Generative API usage consumption by either:
34+
- Providing them an access to Grafana inside [Cockpit](https://console.scaleway.com/cockpit/overview). You can create dedicated [Grafana users](https://console.scaleway.com/cockpit/users) (unrelated to Scaleway IAM currently), with read-only accesses (**Viewer** Role). Note that these users will still have access to all other Cockpit dashboards for this project.
35+
- Collecting consumption data from [Billing API](https://www.scaleway.com/en/developers/api/billing/#path-consumption-get-monthly-consumption) and expose it to your users. Consumption can be detailed by projects.
36+
- Collecting consumption data from [Cockpit Data Source](https://console.scaleway.com/cockpit/dataSource) and expose it to your users. As an example, you can query consumption using the following query:
37+
```curl
38+
curl -G 'https://{data-source-id}.metrics.cockpit.fr-par.scw.cloud/prometheus/api/v1/query_range' \
39+
--data-urlencode 'query=generative_apis_tokens_total{resource_name=~".*",type=~"(input_tokens|output_tokens)"}' \
40+
--data-urlencode 'start=2025-03-15T20:10:51.781Z' \
41+
--data-urlencode 'end=2025-03-20T20:10:51.781Z' \
42+
--data-urlencode 'step=1h' \
43+
-H "Authorization: Bearer $COCKPIT_TOKEN" | jq
44+
```
45+
where:
46+
- `data-source-id` is the id of your [Scaleway Metrics data source](https://console.scaleway.com/cockpit/dataSource)
47+
- $COCKPIT_TOKEN is an environment variable storing your [Cockpit Token](https://console.scaleway.com/cockpit/tokens)
48+
49+
You can see your token consumption in [Scaleway Cockpit](/cockpit/). You can access it from the Scaleway console under the [Metrics tab](https://console.scaleway.com/generative-api/metrics).
50+
Note that:
51+
- Cockpits are isolated by Projects, hence you first need to select the right project in the Scaleway console before accessing Cockpit to see your token consumption for this Project (you can see the `project_id` in the Cockpit URL: `https://{project_id}.dashboard.obs.fr-par.scw.cloud/`.
52+
- Cockpit graphs can take up to 1 hour to update token consumption, see [Troubleshooting](https://www.scaleway.com/en/docs/generative-apis/troubleshooting/fixing-common-issues/#tokens-consumption-is-not-displayed-in-cockpit-metrics) for further details.
53+
3254
## Can I configure a maximum billing threshold?
3355
Currently, you cannot configure a specific threshold after which your usage will blocked. However:
3456
- You can [configure billing alerts](/billing/how-to/use-billing-alerts/) to ensure you are warned when you hit specific budget thresholds.

0 commit comments

Comments
 (0)