-
Notifications
You must be signed in to change notification settings - Fork 182
Fixed epp pod starting but not working when using multiple schedulingProfiles #1698
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Fixed epp pod starting but not working when using multiple schedulingProfiles #1698
Conversation
✅ Deploy Preview for gateway-api-inference-extension ready!
To edit notification comments on pull requests, go to your Netlify project configuration. |
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: learner0810 The full list of commands accepted by this bot can be found here.
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
Hi @learner0810. Thanks for your PR. I'm waiting for a kubernetes-sigs member to verify that this patch is reasonable to test. If it is, they should reply with Once the patch is verified, the new status will be reflected by the I understand the commands that are listed here. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
f445ba5
to
f441bfd
Compare
ping @ahg-g |
@learner0810 not sure if we want to test that on startup. we had that discussion in #1169, where I was actually suggesting to check it but was convinced by others that we shouldn’t. there might be plugins combinations that work only if they are set together (e.g., in llm-d, pd profile handler will work correctly only if p filter is defined in prefill scheduling profile and d filter in decode profile). I think we don’t want to validate these kind of things cause it has no end. |
Even if misconfigured plugins can successfully route requests (even if routed to the wrong Pod), this may be tolerable. However, if misconfiguration causes requests to fail, we must test this during startup. Here is my understanding . Please correct me if I'm wrong. |
I think I'm okay with flagging this edge case. Esp because it references the default handler. Making our framework easier to use is a big plus wrt adoption. /ok-to-test |
This is a fair point.... but the profile handler is kind of nuanced and a more complex part of our system. It is probably a real gotcha to try and do something sophisticated (multiple profiles) and then cut yourself on the no handler implementation. |
I’m fine with that. for me it feels like testing the internals of plugins, but if you feel we want it on startup let’s go for it. |
What type of PR is this?
/kind bug
What this PR does / why we need it:
Fixed epp pod starting but not working when using multiple schedulingProfiles
Which issue(s) this PR fixes:
Fixes # #1697
Does this PR introduce a user-facing change?: