update pd disaggregation templates and example #133

elieserr · 2025-10-15T01:54:39Z

This include various changes

use ghcr.io/llm-d/llm-d-cuda:v0.3.0 for prefill and decode deployments
use ghcr.io/llm-d/llm-d-inference-scheduler:v0.3.1
update the template for inference scheduler to match v0.3.1 args
use default-pd-config.yaml for EPP config
set VLLM_NIXL_SIDE_CHANNEL_PORT variable to default vLLM port
disable default deprecated inferencemodels
replaces rbac of inferencemodels with inferenceobjectives
reduce cpu requirements since the model of the example fits on smaller node

Fixes #130

… other nits Signed-off-by: elieser pereira <[email protected]>

Signed-off-by: elieser pereira <[email protected]>

Signed-off-by: Elieser Pereira <[email protected]>

kalantar · 2025-10-21T12:42:50Z

we are thinking that the endpoint picker/inferencepool pieces should all be removed from the modelservice chart. There is an upstream chart defined here: https://github.com/kubernetes-sigs/gateway-api-inference-extension/tree/main/config/charts/inferencepool (released versions at oci://registry.k8s.io/gateway-api-inference-extension/charts/inferencepool) that has all these updates already. Can you try this and see where you are stuck?

See #135.

elieserr mentioned this pull request Oct 15, 2025

vLLM and llm-d PD disaggregation autoscaling kedify/examples#83

Merged

elieserr changed the title ~~update pd disaggregation example~~ update pd disaggregation templates and example Oct 15, 2025

elieserr added 3 commits October 20, 2025 14:43

update pd dissagregation example to use inference scheduler 0.3.1 and…

d76a008

… other nits Signed-off-by: elieser pereira <[email protected]>

fix linter error + include output-pd.yaml

0265cf6

Signed-off-by: elieser pereira <[email protected]>

include generated files

1254d08

Signed-off-by: elieser pereira <[email protected]>

elieserr force-pushed the update-pd-disagregation-example branch from c6f5b5e to 1254d08 Compare October 20, 2025 18:48

use proper image

7d78630

Signed-off-by: Elieser Pereira <[email protected]>

elieserr force-pushed the update-pd-disagregation-example branch from c7ecf31 to 7d78630 Compare October 20, 2025 23:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

update pd disaggregation templates and example #133

update pd disaggregation templates and example #133

Uh oh!

elieserr commented Oct 15, 2025 •

edited

Loading

Uh oh!

kalantar commented Oct 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

update pd disaggregation templates and example #133

Are you sure you want to change the base?

update pd disaggregation templates and example #133

Uh oh!

Conversation

elieserr commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kalantar commented Oct 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

elieserr commented Oct 15, 2025 •

edited

Loading