Skip to content

Conversation

@tao12345666333
Copy link
Contributor

What type of PR is this?

metrics: Add request-level token histograms

What this PR does / why we need it:

Add request-level token histograms:
• llm_prompt_tokens_per_request (HistogramVec; labels: model)
• llm_completion_tokens_per_request (HistogramVec; labels: model)

Introduce a Counter for category mix:
• llm_category_classifications_count (CounterVec; labels: category)

Remove existing Gauge llm_category_classifications_total

Which issue(s) this PR fixes:

Fixes #123

Release Notes: Yes/No

@netlify
Copy link

netlify bot commented Sep 17, 2025

Deploy Preview for vllm-semantic-router ready!

Name Link
🔨 Latest commit 15374f0
🔍 Latest deploy log https://app.netlify.com/projects/vllm-semantic-router/deploys/68cb798fb86db800081cb4bf
😎 Deploy Preview https://deploy-preview-157--vllm-semantic-router.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@github-actions
Copy link

github-actions bot commented Sep 17, 2025

👥 vLLM Semantic Team Notification

The following members have been identified for the changed files in this PR and have been automatically assigned:

📁 src

Owners: @rootfs, @Xunzhuo, @wangchen615
Files changed:

  • src/semantic-router/pkg/consts/consts.go
  • src/semantic-router/pkg/extproc/metrics_integration_test.go
  • src/semantic-router/pkg/extproc/reason_mode_selector.go
  • src/semantic-router/pkg/metrics/metrics.go

📁 deploy

Owners: @rootfs, @Xunzhuo
Files changed:

  • deploy/llm-router-dashboard.json

vLLM

🎉 Thanks for your contributions!

This comment was automatically generated based on the OWNER files in the repository.

@tao12345666333
Copy link
Contributor Author

I don’t think the CI error is related to the changes in this PR.
Could you help me rerun the CI? Thanks. @Xunzhuo

@tao12345666333 tao12345666333 force-pushed the o11y-metrics-token-histograms-category branch from ad70826 to 15374f0 Compare September 18, 2025 03:16
@rootfs rootfs merged commit 9f41254 into vllm-project:main Sep 18, 2025
9 checks passed
yossiovadia pushed a commit to yossiovadia/semantic-router that referenced this pull request Sep 22, 2025
* metrics: Add request-level token histograms

Signed-off-by: Jintao Zhang <[email protected]>

* add unknown const

Signed-off-by: Jintao Zhang <[email protected]>

---------

Signed-off-by: Jintao Zhang <[email protected]>
yossiovadia pushed a commit to yossiovadia/semantic-router that referenced this pull request Oct 8, 2025
* metrics: Add request-level token histograms

Signed-off-by: Jintao Zhang <[email protected]>

* add unknown const

Signed-off-by: Jintao Zhang <[email protected]>

---------

Signed-off-by: Jintao Zhang <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Metrics: Prompt/completion length histograms and category mix counters

5 participants