-
Notifications
You must be signed in to change notification settings - Fork 2
Open
Labels
Complexity: M1-8 hours1-8 hoursPriority: Low1-7 days1-7 daysType: EnhancementNew feature or requestNew feature or request
Description
Description
The current vCache implementation evicts based on the number of embeddings in the cache. To enable a more fine-grained eviction, vCache should evict based on a MegaByte target.
Impact
- Who: End user
- What: Fine-grained eviction
- Why: Real-world applicability
Proposed Solution
In addition to the number of embedding eviction thresholds, a user should be able to enter a maximum MB amount. It's important that the MB calculation includes the metadata.
Acceptance Criteria
- Followed coding conventions
- Implemented or updated tests
- Support maximum MB eviction threshold
- MB calculation includes embedding metadata
Risks & Dependencies
--
Additional Context
--
Metadata
Metadata
Assignees
Labels
Complexity: M1-8 hours1-8 hoursPriority: Low1-7 days1-7 daysType: EnhancementNew feature or requestNew feature or request