Skip to content

Conversation

@RishabhSaini
Copy link

@RishabhSaini RishabhSaini commented Jan 14, 2026

@github-actions
Copy link

🚨 Unsigned commits detected! Please sign your commits.

For instructions on how to set up GPG/SSH signing and verify your commits, please see GitHub Documentation.

@RishabhSaini RishabhSaini changed the title LatencyPredictionScorer for SLO Aware routing LatencyPredictionScorer for PD Jan 14, 2026
@elevran elevran added this to the v0.6 milestone Jan 22, 2026
  - Add PDPredictionRequestBuilder to populate PodType from llm-d.ai/role labels
  - Add pd-slo-aware-scorer plugin wrapping slo_aware_router with P/D builder
  - Register pd-slo-aware-scorer in plugin registry
  - Add example EPP config for P/D SLO-aware scheduling (pd-slo-epp-config.yaml)
  - Add comprehensive guide on P/D SLO scheduling (docs/pd-slo-aware-scheduling.md)

  Enables separate latency prediction models for prefill vs decode workloads.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Status: No status

Development

Successfully merging this pull request may close these issues.

2 participants