-
Notifications
You must be signed in to change notification settings - Fork 287
feat: support kubernetes environment #245
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
✅ Deploy Preview for vllm-semantic-router ready!
To edit notification comments on pull requests, go to your Netlify project configuration. |
👥 vLLM Semantic Team NotificationThe following members have been identified for the changed files in this PR and have been automatically assigned: 📁
|
c8d9b80 to
2afeebd
Compare
2afeebd to
2ebe3e8
Compare
|
Blocked by envoyproxy/ai-gateway#1239 |
d396250 to
b6a96de
Compare
Signed-off-by: bitliu <[email protected]>
b6a96de to
b85c2c5
Compare
|
/hold cancel |
| @@ -0,0 +1,60 @@ | |||
| apiVersion: inference.networking.x-k8s.io/v1alpha2 | |||
| kind: InferencePool | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This use of InferencePool does not seem to match with the Gateway API Inference SIG's design intent for InferencePool. I do see some reasoning why you chose to do this. However this can potentially cause issues down the road. Is there a design document where you provide some background for this design choice ?
Signed-off-by: bitliu <[email protected]> Signed-off-by: liuhy <[email protected]>
Signed-off-by: bitliu <[email protected]>

What type of PR is this?
feat: support kubernetes environment
What this PR does / why we need it:
This PR added kubernetes support, as well as support in Kind.
Release Notes: Yes