-
Notifications
You must be signed in to change notification settings - Fork 276
Commit 3230c35
Merge candle refactoring 3 (#525)
* Update test description from Math to General (#483)
Signed-off-by: carlory <[email protected]>
* feat: add HuggingChat support (#477)
* add chat ui to dashboard and docker compose & refactor dashboard/backend/
Signed-off-by: JaredforReal <[email protected]>
* try fix network error
Signed-off-by: JaredforReal <[email protected]>
* more
---------
Signed-off-by: JaredforReal <[email protected]>
Co-authored-by: bitliu <[email protected]>
* project: 2025 Q4 roadmap (#487)
* project: q4 roadmap
* project: q4 roadmap
* project: q4 roadmap
* more
* more
* more
* more
* feat: add shelleck precommit hook (#488)
* feat: add shelleck precommit hook
Signed-off-by: yuluo-yx <[email protected]>
* feat: add shelleck precommit hook
Signed-off-by: yuluo-yx <[email protected]>
* feat: add shelleck precommit hook
Signed-off-by: yuluo-yx <[email protected]>
---------
Signed-off-by: yuluo-yx <[email protected]>
* project: add q4 roadmap news (#495)
* fix missing shellcheck in pre-commit image (#497)
Signed-off-by: carlory <[email protected]>
* infra: update tools (#501)
Signed-off-by: yuluo-yx <[email protected]>
* feat(demo): enhance OpenShift demo scripts with improved UX (#478)
- Reduce model selection test to 4 categories (2×Model-A, 2×Model-B)
- Add new "Classification Examples" option calling curl-examples.sh
- Update reasoning examples to avoid cache hits from previous tests
- Remove benign examples from PII and Jailbreak tests (show only attacks)
- Enhance live-semantic-router-logs.sh with better color visibility:
- Fix duplicate "WITH SCORE" text in classification output
- Fix CACHE HIT background color extending over timestamp
- Distinguish reasoning enabled vs disabled messages
- Remove redundant "(standard routing)" text
- Add background colors for Model-A/Model-B routing display
These improvements make the live demo clearer and more impactful for
presentations and demonstrations.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Signed-off-by: Yossi Ovadia <[email protected]>
Co-authored-by: Claude <[email protected]>
* fix: fix precommit Argument list too long error (#502)
Signed-off-by: yuluo-yx <[email protected]>
* feat: enforce milvus dial timeout if set (#503)
Signed-off-by: cryo <[email protected]>
* Add IETF draft publication: Multi-Provider Extensions for Agentic AI Inference APIs (#506)
* Initial plan
* Add new IETF draft publication for Multi-Provider Extensions for Agentic AI Inference APIs
Co-authored-by: rootfs <[email protected]>
---------
Co-authored-by: copilot-swe-agent[bot] <[email protected]>
Co-authored-by: rootfs <[email protected]>
* Allow semantic cache similarity threshold to be set at the category level (#493)
* Initial plan
* Add category-level cache settings: enabled and similarity_threshold
Co-authored-by: rootfs <[email protected]>
* Add comprehensive tests for category-level cache settings
Co-authored-by: rootfs <[email protected]>
* Update config files and documentation for category-level cache settings
- Updated 7 config YAML files (development, production, testing, e2e, and 3 recipes) with commented examples of category-level cache settings
- Added comprehensive documentation section explaining category-level cache configuration
- Updated semantic cache overview and in-memory cache docs with category-level examples
- Added best practices for threshold selection and privacy considerations
Co-authored-by: rootfs <[email protected]>
* Remove duplicate code in FindSimilar functions
Refactored FindSimilar() to delegate to FindSimilarWithThreshold() with default threshold instead of duplicating the entire implementation. This eliminates 226 lines of duplicate code across inmemory_cache.go and milvus_cache.go.
Co-authored-by: rootfs <[email protected]>
* Update src/semantic-router/pkg/extproc/request_handler.go
Co-authored-by: Copilot <[email protected]>
* Revert changes from unsigned commit ae39fe2
Restored the classificationText empty check that was removed in the previous commit.
Co-authored-by: rootfs <[email protected]>
---------
Co-authored-by: copilot-swe-agent[bot] <[email protected]>
Co-authored-by: rootfs <[email protected]>
Co-authored-by: Huamin Chen <[email protected]>
Co-authored-by: Copilot <[email protected]>
* Allow jailbreak detection and threshold to be configured at the category level (#508)
* Initial plan
* Add category-level jailbreak detection configuration
Co-authored-by: Xunzhuo <[email protected]>
* Add documentation for category-level jailbreak settings
Co-authored-by: Xunzhuo <[email protected]>
* Update documentation for category-level jailbreak detection
- Add category-level jailbreak configuration to jailbreak-protection.md
- Update category configuration docs with jailbreak_enabled parameter
- Add security-focused configuration example
- Update global configuration docs with category override notes
- Update README to mention fine-grained security control
Co-authored-by: Xunzhuo <[email protected]>
* Add category-level jailbreak threshold configuration
- Add JailbreakThreshold field to Category struct
- Add GetJailbreakThresholdForCategory helper method
- Create CheckForJailbreakWithThreshold and AnalyzeContentForJailbreakWithThreshold methods
- Update performSecurityChecks to use category-specific threshold
- Add 5 comprehensive tests for threshold configuration
- Update example configs with threshold tuning examples
- Update documentation with threshold configuration and tuning guidelines
- Add threshold tuning guide with recommendations for different category types
Co-authored-by: Xunzhuo <[email protected]>
---------
Co-authored-by: copilot-swe-agent[bot] <[email protected]>
Co-authored-by: Xunzhuo <[email protected]>
* Allow PII detection threshold to be set at the category level (#510)
* Initial plan
* Add category-level PII threshold support
Co-authored-by: Xunzhuo <[email protected]>
* Update documentation with API integration notes
Co-authored-by: Xunzhuo <[email protected]>
* Fix markdown linting issues
Co-authored-by: Xunzhuo <[email protected]>
---------
Co-authored-by: copilot-swe-agent[bot] <[email protected]>
Co-authored-by: Xunzhuo <[email protected]>
* Fix: The caller information points to the wrapper function instead of the actual call location (#518)
Signed-off-by: carlory <[email protected]>
* feat: Implement hybrid cache that use in-memory index and milvus based doc store (#504)
* feat: add HNSW index to inmemory semantic cache and implement hybrid cache that use in-memory index and milvus based doc store
Signed-off-by: Huamin Chen <[email protected]>
* chore: run go mod tidy to clean up module dependencies
Signed-off-by: Huamin Chen <[email protected]>
* conditionally build candle cuda support
Signed-off-by: Huamin Chen <[email protected]>
* rebuild index upon restart
Signed-off-by: Huamin Chen <[email protected]>
* precommit fix
Signed-off-by: Huamin Chen <[email protected]>
* fix precommit
Signed-off-by: Huamin Chen <[email protected]>
* fix precommit
Signed-off-by: Huamin Chen <[email protected]>
* fix precommit
Signed-off-by: Huamin Chen <[email protected]>
* disable cuda build on ci
Signed-off-by: Huamin Chen <[email protected]>
* review feedback
Signed-off-by: Huamin Chen <[email protected]>
* review feedback
Signed-off-by: Huamin Chen <[email protected]>
* review feedback
Signed-off-by: Huamin Chen <[email protected]>
* review feedback
Signed-off-by: Huamin Chen <[email protected]>
---------
Signed-off-by: Huamin Chen <[email protected]>
* merge main to feat branch
Signed-off-by: Huamin Chen <[email protected]>
---------
Signed-off-by: carlory <[email protected]>
Signed-off-by: JaredforReal <[email protected]>
Signed-off-by: yuluo-yx <[email protected]>
Signed-off-by: Yossi Ovadia <[email protected]>
Signed-off-by: cryo <[email protected]>
Signed-off-by: Huamin Chen <[email protected]>
Co-authored-by: 杨朱 · Kiki <[email protected]>
Co-authored-by: Jared <[email protected]>
Co-authored-by: bitliu <[email protected]>
Co-authored-by: shown <[email protected]>
Co-authored-by: Yossi Ovadia <[email protected]>
Co-authored-by: Claude <[email protected]>
Co-authored-by: cryo <[email protected]>
Co-authored-by: Copilot <[email protected]>
Co-authored-by: rootfs <[email protected]>
Co-authored-by: Copilot <[email protected]>
Co-authored-by: Xunzhuo <[email protected]>1 parent b425a6f commit 3230c35Copy full SHA for 3230c35
File tree
Expand file treeCollapse file tree
0 file changed
+0
-0
lines changedOpen diff view settings
Filter options
Expand file treeCollapse file tree
0 file changed
+0
-0
lines changedOpen diff view settings
0 commit comments