v0.2.1-rc.1
Pre-release
Pre-release
Image is available here: docker pull ghcr.io/llm-d/llm-d-inference-scheduler:v0.2.1-rc.1
This patch fix is intended to resolve a few bug fixes.
Justification & breakdown here: kubernetes-sigs/gateway-api-inference-extension#1215
- Helm chart configurability: kubernetes-sigs/gateway-api-inference-extension#1211
- TLS metric scraping: kubernetes-sigs/gateway-api-inference-extension#1190
- Fixing max score picker: kubernetes-sigs/gateway-api-inference-extension#1205