generated from kubernetes/kubernetes-template-project
-
Notifications
You must be signed in to change notification settings - Fork 140
Labels
kind/featureCategorizes issue or PR as related to a new feature.Categorizes issue or PR as related to a new feature.
Description
What would you like to be added:
A new DisaggDeployment CRD that manages two LeaderWorkerSets (prefill + decode) as a single logical unit, with coordinated rolling updates and
service orchestration.
Why is this needed:
Disaggregated LLM inference (prefill/decode separation) requires coordinating two LeaderWorkerSets. Currently users must manually manage both
resources and handle rolling updates across them, which is error-prone and complex.
Completion requirements:
DisaggDeploymentCRD with prefill and decode side configurations- Two-dimensional coordinated rolling updates across both sides
- Automatic Service creation when both sides are ready
This enhancement requires the following artifacts:
- KEP
- Implementation
- Docs update
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
kind/featureCategorizes issue or PR as related to a new feature.Categorizes issue or PR as related to a new feature.