Skip to content

v0.2.0-RC2

Pre-release
Pre-release

Choose a tag to compare

@elevran elevran released this 21 Jul 13:30
· 102 commits to main since this release
v0.2.0-rc.2
eabb332

Image:

ghcr.io/llm-d/llm-d-inference-scheduler:v0.2.0-rc.2

What's Changed

  • bump GIE to v0.5.0-RC3
  • Disable Prefix-Cache-Aware decision, making P/D the default
  • build: change epp-config default manifests` image pull policy
  • Update Prefix-Cache-Scorer cache_tracking mode to use v0.2 KVCache.Indexer

Full Changelog: v0.2.0-RC1...v0.2.0-rc.2