fix(api): expose actual PII confidence scores instead of hardcoded 0.9 #718

yossiovadia · 2025-11-22T01:54:34Z

Summary

This PR fixes the PII API endpoint to return actual confidence scores from the LoRA model instead of hardcoded 0.9 values.

Fixes #717

Changes

Added ClassifyPIIWithDetails() method - New classifier method that returns full entity details including actual confidence scores
Updated DetectPII() service - Modified to use ClassifyPIIWithDetails() instead of ClassifyPII() to access real confidence values
Updated config.e2e.yaml - Configured to use LoRA PII model and added default catch-all decision for E2E testing

Before vs After

Before (Hardcoded)

{
  "entities": [
    {"type": "B-EMAIL_ADDRESS", "confidence": 0.9},
    {"type": "B-PERSON", "confidence": 0.9}
  ]
}

After (Actual Scores)

{
  "entities": [
    {"type": "B-EMAIL_ADDRESS", "confidence": 0.9869},
    {"type": "B-PERSON", "confidence": 0.9991}
  ]
}

Test Results

Local testing shows the API now correctly returns actual model confidence scores:

Test Case	Before	After
Email detection	0.9 (hardcoded)	0.9869 (actual)
SSN detection	0.9 (hardcoded)	0.9916, 0.8326 (actual)
Multiple PII	0.9 (hardcoded)	0.7734-0.9991 range (actual)

Technical Details

The underlying LoRA PII model was already producing accurate confidence scores (fixed in #709), but the API layer was discarding them. This change passes through the actual scores to API consumers.

The new ClassifyPIIWithDetails() method returns []PIIDetection with full entity information, while maintaining backward compatibility by keeping the existing ClassifyPII() method for callers that only need entity types.

Testing

Local testing with make run-router-e2e
Manual API testing with curl
Pre-commit checks passed
DCO sign-off added

Fixes vllm-project#717 Changes: - Add ClassifyPIIWithDetails() method to classifier that returns full entity details including actual confidence scores - Update DetectPII() service to use ClassifyPIIWithDetails() instead of ClassifyPII() to access real confidence values - Update config.e2e.yaml to use LoRA PII model and add default catch-all decision for E2E testing Before: API returned hardcoded confidence=0.9 for all PII entities After: API returns actual model confidence scores (0.77-0.99 range) Test results: - Email detection: 0.9869 (was 0.9) - SSN detection: 0.9916, 0.8326 (was 0.9) - Multiple PII: 0.7734-0.9991 range (was all 0.9) The underlying LoRA PII model was already producing accurate confidence scores (fixed in vllm-project#709), but the API layer was discarding them. This change passes through the actual scores to API consumers. Signed-off-by: Yossi Ovadia <[email protected]>

netlify · 2025-11-22T01:54:39Z

✅ Deploy Preview for vllm-semantic-router ready!

Name	Link
🔨 Latest commit	`f63cf1e`
🔍 Latest deploy log	https://app.netlify.com/projects/vllm-semantic-router/deploys/692117dc83299f0008e4c278
😎 Deploy Preview	https://deploy-preview-718--vllm-semantic-router.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

github-actions · 2025-11-22T01:54:50Z

👥 vLLM Semantic Team Notification

The following members have been identified for the changed files in this PR and have been automatically assigned:

📁 `config`

Owners: @rootfs, @Xunzhuo
Files changed:

config/testing/config.e2e.yaml

📁 `src`

Owners: @rootfs, @Xunzhuo, @wangchen615
Files changed:

src/semantic-router/pkg/classification/classifier.go
src/semantic-router/pkg/services/classification.go

🎉 Thanks for your contributions!

This comment was automatically generated based on the OWNER files in the repository.

yossiovadia requested review from Xunzhuo, rootfs and wangchen615 as code owners November 22, 2025 01:54

github-actions bot assigned rootfs, wangchen615 and Xunzhuo Nov 22, 2025

github-actions bot deleted a comment from codecov-commenter Nov 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(api): expose actual PII confidence scores instead of hardcoded 0.9 #718

fix(api): expose actual PII confidence scores instead of hardcoded 0.9 #718

yossiovadia commented Nov 22, 2025

Uh oh!

netlify bot commented Nov 22, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Nov 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix(api): expose actual PII confidence scores instead of hardcoded 0.9 #718

Are you sure you want to change the base?

fix(api): expose actual PII confidence scores instead of hardcoded 0.9 #718

Conversation

yossiovadia commented Nov 22, 2025

Summary

Changes

Before vs After

Before (Hardcoded)

After (Actual Scores)

Test Results

Technical Details

Testing

Uh oh!

netlify bot commented Nov 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for vllm-semantic-router ready!

Uh oh!

github-actions bot commented Nov 22, 2025

👥 vLLM Semantic Team Notification

📁 config

📁 src

🎉 Thanks for your contributions!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

netlify bot commented Nov 22, 2025 •

edited

Loading

📁 `config`

📁 `src`