feat: add milvus persistent storage support #105

rootfs · 2025-09-09T16:39:33Z

What type of PR is this?

This is a WIP to support more persistent storage for semantic cache

What this PR does / why we need it:

Which issue(s) this PR fixes:

Fixes #94 #95

Release Notes: Yes/No

netlify · 2025-09-09T16:39:38Z

✅ Deploy Preview for vllm-semantic-router ready!

Name	Link
🔨 Latest commit	`c8e9266`
🔍 Latest deploy log	https://app.netlify.com/projects/vllm-semantic-router/deploys/68c20e7676ad2b00087dd040
😎 Deploy Preview	https://deploy-preview-105--vllm-semantic-router.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

github-actions · 2025-09-09T16:41:58Z

👥 vLLM Semantic Team Notification

The following members have been identified for the changed files in this PR and have been automatically assigned:

📁 `config`

Owners: @rootfs
Files changed:

config/cache/milvus.yaml
config/config.yaml

📁 `src`

Owners: @rootfs, @Xunzhuo, @wangchen615
Files changed:

src/semantic-router/pkg/cache/cache_factory.go
src/semantic-router/pkg/cache/cache_interface.go
src/semantic-router/pkg/cache/inmemory_cache.go
src/semantic-router/pkg/cache/milvus_cache.go
src/semantic-router/go.mod
src/semantic-router/go.sum
src/semantic-router/pkg/cache/cache.go
src/semantic-router/pkg/cache/cache_test.go
src/semantic-router/pkg/config/config.go
src/semantic-router/pkg/config/config_test.go
src/semantic-router/pkg/extproc/caching_test.go
src/semantic-router/pkg/extproc/router.go
src/semantic-router/pkg/extproc/test_utils_test.go
src/semantic-router/pkg/metrics/metrics.go

📁 `Root Directory`

Owners: @rootfs, @Xunzhuo
Files changed:

Makefile

🎉 Thanks for your contributions!

This comment was automatically generated based on the OWNER files in the repository.

rootfs · 2025-09-09T17:23:42Z

CI failed due to missing running Milvus. For now, just skip these tests on CI

rootfs · 2025-09-09T17:51:39Z

@Xunzhuo No doc change in this PR. I'll add more doc on how to setup Milvus and inmemory caching in a following one.

- Create CacheBackend interface with pluggable architecture - Refactor existing in-memory cache to implement new interface - Add cache factory pattern for backend selection - Support configurable similarity thresholds and TTL - Add comprehensive cache metrics and observability Addresses vllm-project#94 Signed-off-by: Huamin Chen <[email protected]>

- Implement MilvusCache backend with persistent storage - Add Milvus configuration file and connection management - Support vector similarity search with configurable indexing - Add TTL support and collection lifecycle management - Include Milvus dependencies and build configuration Addresses vllm-project#95 Signed-off-by: Huamin Chen <[email protected]>

Signed-off-by: Huamin Chen <[email protected]>

rootfs · 2025-09-11T00:02:09Z

merging this PR now. cc @Xunzhuo @aeft

rootfs requested review from Xunzhuo and wangchen615 as code owners September 9, 2025 16:39

rootfs force-pushed the semcaching branch 2 times, most recently from 3eec72b to 9efcc06 Compare September 9, 2025 16:41

github-actions bot assigned rootfs, wangchen615 and Xunzhuo Sep 9, 2025

Xunzhuo changed the title ~~feat: Semantic Cache Refactoring and Milvus Persistent Storage Support~~ feat: add milvus persistent storage support Sep 10, 2025

rootfs mentioned this pull request Sep 10, 2025

Semantic Cache: Support More Cache Cleanup Operations #109

Closed

rootfs added 5 commits September 10, 2025 23:45

toggle milvus unit test

43a5949

Signed-off-by: Huamin Chen <[email protected]>

pre-commit fix

2fb44f5

Signed-off-by: Huamin Chen <[email protected]>

rebase

c8e9266

Signed-off-by: Huamin Chen <[email protected]>

rootfs force-pushed the semcaching branch from cfbb6dc to c8e9266 Compare September 10, 2025 23:49

rootfs merged commit 3ce8a6e into vllm-project:main Sep 11, 2025
9 checks passed

This was referenced Sep 12, 2025

Semantic Cache Refactoring: Support Milvus VectorDB Backend #95

Closed

docs: add semantic cache doc to explain how to use in-memory and milvus in the config #139

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add milvus persistent storage support #105

feat: add milvus persistent storage support #105

Uh oh!

rootfs commented Sep 9, 2025

Uh oh!

netlify bot commented Sep 9, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Sep 9, 2025 •

edited

Loading

Uh oh!

rootfs commented Sep 9, 2025

Uh oh!

rootfs commented Sep 9, 2025

Uh oh!

rootfs commented Sep 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: add milvus persistent storage support #105

feat: add milvus persistent storage support #105

Uh oh!

Conversation

rootfs commented Sep 9, 2025

Uh oh!

netlify bot commented Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for vllm-semantic-router ready!

Uh oh!

github-actions bot commented Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

👥 vLLM Semantic Team Notification

📁 config

📁 src

📁 Root Directory

🎉 Thanks for your contributions!

Uh oh!

rootfs commented Sep 9, 2025

Uh oh!

rootfs commented Sep 9, 2025

Uh oh!

rootfs commented Sep 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

netlify bot commented Sep 9, 2025 •

edited

Loading

github-actions bot commented Sep 9, 2025 •

edited

Loading

📁 `config`

📁 `src`

📁 `Root Directory`