-
Notifications
You must be signed in to change notification settings - Fork 275
metrics: Add request-level token histograms #157
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
metrics: Add request-level token histograms #157
Conversation
✅ Deploy Preview for vllm-semantic-router ready!
To edit notification comments on pull requests, go to your Netlify project configuration. |
👥 vLLM Semantic Team NotificationThe following members have been identified for the changed files in this PR and have been automatically assigned: 📁
|
|
I don’t think the CI error is related to the changes in this PR. |
Signed-off-by: Jintao Zhang <[email protected]>
Signed-off-by: Jintao Zhang <[email protected]>
ad70826 to
15374f0
Compare
* metrics: Add request-level token histograms Signed-off-by: Jintao Zhang <[email protected]> * add unknown const Signed-off-by: Jintao Zhang <[email protected]> --------- Signed-off-by: Jintao Zhang <[email protected]>
* metrics: Add request-level token histograms Signed-off-by: Jintao Zhang <[email protected]> * add unknown const Signed-off-by: Jintao Zhang <[email protected]> --------- Signed-off-by: Jintao Zhang <[email protected]>

What type of PR is this?
metrics: Add request-level token histograms
What this PR does / why we need it:
Add request-level token histograms:
• llm_prompt_tokens_per_request (HistogramVec; labels: model)
• llm_completion_tokens_per_request (HistogramVec; labels: model)
Introduce a Counter for category mix:
• llm_category_classifications_count (CounterVec; labels: category)
Remove existing Gauge
llm_category_classifications_totalWhich issue(s) this PR fixes:
Fixes #123
Release Notes: Yes/No