Skip to content

Releases: llm-d/llm-d-inference-scheduler

v0.3.2

09 Oct 00:08
v0.3.2

Choose a tag to compare

In addition to the below changes these patches include fixes to the kv-cache-manager dependency

What's Changed

New Contributors

Full Changelog: v0.2.1...v0.3.2

v0.3.2-rc.1

03 Oct 18:51
v0.3.2-rc.1

Choose a tag to compare

v0.3.2-rc.1 Pre-release
Pre-release

Small fixes to kv-cache-manager required updated dependencies

v0.3.1

29 Sep 20:52
v0.3.1

Choose a tag to compare

Small patch updating kv cache manager dependency to include support in v0.3

See the full v0.3 changes here:

What's Changed

New Contributors

Full Changelog: v0.2.1...v0.3.1

v0.3.1-rc.1

26 Sep 02:03
v0.3.1-rc.1

Choose a tag to compare

v0.3.1-rc.1 Pre-release
Pre-release

Full Changelog: v0.3.0...v0.3.1-rc.1

v0.3.0

24 Sep 19:30
v0.3.0
1889019

Choose a tag to compare

Image pull example: docker pull ghcr.io/llm-d/llm-d-inference-scheduler:v0.3.0

What's Changed

New Contributors

Full Changelog: v0.2.1...v0.3.0

v0.3.0-rc.2

17 Sep 11:06
v0.3.0-rc.2
1889019

Choose a tag to compare

v0.3.0-rc.2 Pre-release
Pre-release

Image is available here: docker pull ghcr.io/llm-d/llm-d-inference-scheduler:v0.3.0-rc.2

v0.3.0-rc.1

05 Sep 07:49
v0.3.0-rc.1
92619ae

Choose a tag to compare

v0.3.0-rc.1 Pre-release
Pre-release

Image is available here: docker pull ghcr.io/llm-d/llm-d-inference-scheduler:v0.3.0-rc.1

What's Changed

New Contributors

Full Changelog: v0.2.0-rc.2...v0.3.0-rc.1

v0.2.1

24 Jul 05:58
v0.2.1
c97e2ea

Choose a tag to compare

Image is available here: docker pull ghcr.io/llm-d/llm-d-inference-scheduler:v0.2.1

This patch fix is intended to resolve a few bug fixes.
Justification & breakdown here: kubernetes-sigs/gateway-api-inference-extension#1215

Full Changelog: v0.2.0...v0.2.1

v0.2.1-rc.1

23 Jul 08:41
v0.2.1-rc.1
4a86b36

Choose a tag to compare

v0.2.1-rc.1 Pre-release
Pre-release

Image is available here: docker pull ghcr.io/llm-d/llm-d-inference-scheduler:v0.2.1-rc.1

This patch fix is intended to resolve a few bug fixes.
Justification & breakdown here: kubernetes-sigs/gateway-api-inference-extension#1215

v0.2.0

21 Jul 19:48
v0.2.0
f3e2779

Choose a tag to compare

Overview

the image is available here: docker pull ghcr.io/llm-d/llm-d-inference-scheduler:v0.2.0

Major Highlights

  • Enhanced Scheduling Framework: enhanced the scheduling framework to include additional extension points and established a well defined mechanism for scheduler plugins inter-communication.

  • New Config API: A new Config API which allows the configuration of plugins through a config file without touching core code.

  • Helm Charts: helm chart update to support the reuse of Config API easily.

  • Plugins Improvements: improved multiple plugins implementations and consolidated some of the GIE and llm-d plugins into a single robust implementation.

What's Changed

New Contributors

Full Changelog: v0.1.0...v0.2.0