Skip to content

Conversation

@yossiovadia
Copy link
Collaborator

All 3 tests now pass successfully with verified classification routing:

  • Category Classification: Math→Model-B, CS→Model-B, Business→Model-A, History→Model-A ✅
  • Classification Consistency: Same query routes to same model ✅
    Release Notes: No

…ssification-test.py

- Fix import path: change from 'tests.test_base' to 'test_base'
- Add missing 'import unittest' statement
- Update DEFAULT_MODEL from 'qwen2.5:32b' to 'Model-A' to match e2e config
- Increase timeout from 10s to 60s to accommodate LLM Katan response times
- Use 'model: auto' to trigger category-based classification routing
- Add 4 comprehensive test cases: math, computer science, business, history
- Add expected_model field to verify correct routing
- Add assertions to verify actual model matches expected model
- Enhance test output to show expected vs actual routing
- Fix metrics test to check for actual exposed metrics (entropy classification, cache)
- Update README to mark 01 and 02 tests as completed with descriptions

All 3 tests now pass successfully with verified classification routing:
- Category Classification: Math→Model-B, CS→Model-B, Business→Model-A, History→Model-A ✅
- Classification Consistency: Same query routes to same model ✅
- Router Metrics: Entropy classification, cache hits/misses tracked ✅

Signed-off-by: Yossi Ovadia <[email protected]>
@yossiovadia yossiovadia requested a review from rootfs as a code owner October 1, 2025 19:38
@netlify
Copy link

netlify bot commented Oct 1, 2025

Deploy Preview for vllm-semantic-router ready!

Name Link
🔨 Latest commit a316521
🔍 Latest deploy log https://app.netlify.com/projects/vllm-semantic-router/deploys/68dd87d51b04da0008122117
😎 Deploy Preview https://deploy-preview-302--vllm-semantic-router.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@github-actions
Copy link

github-actions bot commented Oct 1, 2025

👥 vLLM Semantic Team Notification

The following members have been identified for the changed files in this PR and have been automatically assigned:

📁 e2e-tests

Owners: @yossiovadia
Files changed:

  • e2e-tests/02-router-classification-test.py
  • e2e-tests/README.md

vLLM

🎉 Thanks for your contributions!

This comment was automatically generated based on the OWNER files in the repository.

rootfs
rootfs previously approved these changes Oct 1, 2025
@rootfs
Copy link
Collaborator

rootfs commented Oct 1, 2025

Remove trailing whitespace from lines 239, 297, and 298 to pass
pre-commit checks.

Signed-off-by: Yossi Ovadia <[email protected]>
Add trailing comma after last argument in assertGreater call to comply
with black formatting standards.

Signed-off-by: Yossi Ovadia <[email protected]>
@rootfs rootfs merged commit 5141a00 into vllm-project:main Oct 1, 2025
9 checks passed
Aias00 pushed a commit to Aias00/semantic-router that referenced this pull request Oct 4, 2025
…m-project#302)

* fix: enable and verify router classification testing in 02-router-classification-test.py

- Fix import path: change from 'tests.test_base' to 'test_base'
- Add missing 'import unittest' statement
- Update DEFAULT_MODEL from 'qwen2.5:32b' to 'Model-A' to match e2e config
- Increase timeout from 10s to 60s to accommodate LLM Katan response times
- Use 'model: auto' to trigger category-based classification routing
- Add 4 comprehensive test cases: math, computer science, business, history
- Add expected_model field to verify correct routing
- Add assertions to verify actual model matches expected model
- Enhance test output to show expected vs actual routing
- Fix metrics test to check for actual exposed metrics (entropy classification, cache)
- Update README to mark 01 and 02 tests as completed with descriptions

All 3 tests now pass successfully with verified classification routing:
- Category Classification: Math→Model-B, CS→Model-B, Business→Model-A, History→Model-A ✅
- Classification Consistency: Same query routes to same model ✅
- Router Metrics: Entropy classification, cache hits/misses tracked ✅

Signed-off-by: Yossi Ovadia <[email protected]>

* fix: remove trailing whitespace in 02-router-classification-test.py

Remove trailing whitespace from lines 239, 297, and 298 to pass
pre-commit checks.

Signed-off-by: Yossi Ovadia <[email protected]>

* style: apply black formatter to 02-router-classification-test.py

Add trailing comma after last argument in assertGreater call to comply
with black formatting standards.

Signed-off-by: Yossi Ovadia <[email protected]>

---------

Signed-off-by: Yossi Ovadia <[email protected]>
Signed-off-by: liuhy <[email protected]>
Aias00 pushed a commit to Aias00/semantic-router that referenced this pull request Oct 4, 2025
…m-project#302)

* fix: enable and verify router classification testing in 02-router-classification-test.py

- Fix import path: change from 'tests.test_base' to 'test_base'
- Add missing 'import unittest' statement
- Update DEFAULT_MODEL from 'qwen2.5:32b' to 'Model-A' to match e2e config
- Increase timeout from 10s to 60s to accommodate LLM Katan response times
- Use 'model: auto' to trigger category-based classification routing
- Add 4 comprehensive test cases: math, computer science, business, history
- Add expected_model field to verify correct routing
- Add assertions to verify actual model matches expected model
- Enhance test output to show expected vs actual routing
- Fix metrics test to check for actual exposed metrics (entropy classification, cache)
- Update README to mark 01 and 02 tests as completed with descriptions

All 3 tests now pass successfully with verified classification routing:
- Category Classification: Math→Model-B, CS→Model-B, Business→Model-A, History→Model-A ✅
- Classification Consistency: Same query routes to same model ✅
- Router Metrics: Entropy classification, cache hits/misses tracked ✅

Signed-off-by: Yossi Ovadia <[email protected]>

* fix: remove trailing whitespace in 02-router-classification-test.py

Remove trailing whitespace from lines 239, 297, and 298 to pass
pre-commit checks.

Signed-off-by: Yossi Ovadia <[email protected]>

* style: apply black formatter to 02-router-classification-test.py

Add trailing comma after last argument in assertGreater call to comply
with black formatting standards.

Signed-off-by: Yossi Ovadia <[email protected]>

---------

Signed-off-by: Yossi Ovadia <[email protected]>
Signed-off-by: liuhy <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants