You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat: enable E2E testing with LLM Katan - 00-client-request-test (vllm-project#290)
* feat: enable E2E testing with LLM Katan and fix configuration
- Remove Ollama dependencies from E2E config as requested
- Update config.e2e.yaml to use only LLM Katan models (Qwen/Qwen2-0.5B-Instruct, TinyLlama/TinyLlama-1.1B-Chat-v1.0)
- Fix bash 3.2 compatibility in start-llm-katan.sh (replace associative arrays)
- Add required use_reasoning fields to all model entries for validation
- Fix zero scores in model configurations (0.0 → 0.1)
Testing Status:
- ✅ Router: Successfully starts with E2E config (ExtProc on :50051, API on :8080)
- ✅ LLM Katan: Running on ports 8000/8001 with correct model mapping
- ✅ Envoy: Running on port 8801
- ✅ Test: 00-client-request-test.py passes with 200 OK responses
- ✅ Pipeline: Full end-to-end flow working (Client → Envoy → ExtProc → LLM Katan)
🤖 Generated with [Claude Code](https://claude.ai/code)
Co-Authored-By: Claude <[email protected]>
Signed-off-by: Yossi Ovadia <[email protected]>
* fix: apply pre-commit formatting fixes
Apply black and isort formatting to LLM Katan Python files
as required by pre-commit hooks.
🤖 Generated with [Claude Code](https://claude.ai/code)
Co-Authored-By: Claude <[email protected]>
Signed-off-by: Yossi Ovadia <[email protected]>
* refactor: simplify model names to Model-A and Model-B for E2E testing
- Update LLM Katan configuration to use simplified model names
- Simplify 00-client-request-test.py to use Model-A as default
- Update documentation to reflect math → Model-B, creative → Model-A routing
- Improve test readability and maintainability
🤖 Generated with [Claude Code](https://claude.ai/code)
Co-Authored-By: Claude <[email protected]>
Signed-off-by: Yossi Ovadia <[email protected]>
* fix: apply pre-commit formatting fixes
- Fix markdown linting issues in CLAUDE.md files
- Apply black formatting to Python files
🤖 Generated with [Claude Code](https://claude.ai/code)
Co-Authored-By: Claude <[email protected]>
Signed-off-by: Yossi Ovadia <[email protected]>
---------
Signed-off-by: Yossi Ovadia <[email protected]>
Co-authored-by: Claude <[email protected]>
Signed-off-by: liuhy <[email protected]>
0 commit comments