feat: support kubernetes environment #245

Xunzhuo · 2025-09-27T10:09:56Z

What type of PR is this?

feat: support kubernetes environment

What this PR does / why we need it:

This PR added kubernetes support, as well as support in Kind.

Release Notes: Yes

netlify · 2025-09-27T10:10:02Z

✅ Deploy Preview for vllm-semantic-router ready!

Name	Link
🔨 Latest commit	`b85c2c5`
🔍 Latest deploy log	https://app.netlify.com/projects/vllm-semantic-router/deploys/68d8efc6fa68ff0008a97fea
😎 Deploy Preview	https://deploy-preview-245--vllm-semantic-router.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

github-actions · 2025-09-27T10:10:11Z

👥 vLLM Semantic Team Notification

The following members have been identified for the changed files in this PR and have been automatically assigned:

📁 `deploy`

Owners: @rootfs, @Xunzhuo
Files changed:

deploy/kubernetes/ai-gateway/README.md
deploy/kubernetes/ai-gateway/configuration/config.yaml
deploy/kubernetes/ai-gateway/configuration/rbac.yaml
deploy/kubernetes/ai-gateway/configuration/redis.yaml
deploy/kubernetes/ai-gateway/inference-pool/inference-pool.yaml
deploy/kubernetes/README.md
deploy/kubernetes/config.yaml
deploy/kubernetes/deployment.yaml
deploy/kubernetes/kustomization.yaml
deploy/kubernetes/namespace.yaml
deploy/kubernetes/pvc.yaml
deploy/kubernetes/service.yaml

📁 `src`

Owners: @rootfs, @Xunzhuo, @wangchen615
Files changed:

src/semantic-router/pkg/utils/tls/tls.go
src/semantic-router/cmd/main.go
src/semantic-router/pkg/extproc/endpoint_selection_test.go
src/semantic-router/pkg/extproc/request_handler.go
src/semantic-router/pkg/extproc/server.go

📁 `tools`

Owners: @yuluo-yx, @rootfs, @Xunzhuo
Files changed:

tools/kind/kind-config.yaml
tools/make/kube.mk

📁 `website`

Owners: @Xunzhuo
Files changed:

website/docs/installation/docker-compose.md
website/docs/installation/kubernetes.md
website/docs/api/router.md
website/docs/overview/architecture/envoy-extproc.md
website/docs/overview/architecture/system-architecture.md
website/docs/tutorials/intelligent-route/reasoning.md
website/sidebars.js

📁 `Root Directory`

Owners: @rootfs, @Xunzhuo
Files changed:

Makefile
scripts/entrypoint.sh

📁 `config`

Owners: @rootfs
Files changed:

config/envoy-docker.yaml
config/envoy.yaml

🎉 Thanks for your contributions!

This comment was automatically generated based on the OWNER files in the repository.

deploy/kubernetes/ai-gateway/configuration/config.yaml

rootfs · 2025-09-27T14:45:25Z

ref: #89 #90

Xunzhuo · 2025-09-28T05:40:32Z

Blocked by envoyproxy/ai-gateway#1239

Signed-off-by: bitliu <[email protected]>

Xunzhuo · 2025-09-28T13:31:09Z

/hold cancel

srampal · 2025-09-29T06:50:16Z

deploy/kubernetes/ai-gateway/inference-pool/inference-pool.yaml

@@ -0,0 +1,60 @@
+apiVersion: inference.networking.x-k8s.io/v1alpha2
+kind: InferencePool


This use of InferencePool does not seem to match with the Gateway API Inference SIG's design intent for InferencePool. I do see some reasoning why you chose to do this. However this can potentially cause issues down the road. Is there a design document where you provide some background for this design choice ?

Signed-off-by: bitliu <[email protected]> Signed-off-by: liuhy <[email protected]>

Signed-off-by: bitliu <[email protected]>

github-actions bot assigned rootfs, wangchen615 and Xunzhuo Sep 27, 2025

Xunzhuo force-pushed the support-k8s branch 2 times, most recently from c8d9b80 to 2afeebd Compare September 27, 2025 11:56

rootfs reviewed Sep 27, 2025

View reviewed changes

deploy/kubernetes/ai-gateway/configuration/config.yaml Outdated Show resolved Hide resolved

Xunzhuo force-pushed the support-k8s branch from 2afeebd to 2ebe3e8 Compare September 28, 2025 04:58

Xunzhuo force-pushed the support-k8s branch from d396250 to b6a96de Compare September 28, 2025 06:55

Xunzhuo marked this pull request as ready for review September 28, 2025 08:13

Xunzhuo requested a review from wangchen615 as a code owner September 28, 2025 08:13

feat: support running vsr in kubernetes environment

b85c2c5

Signed-off-by: bitliu <[email protected]>

Xunzhuo force-pushed the support-k8s branch from b6a96de to b85c2c5 Compare September 28, 2025 08:20

Xunzhuo added the priority/P0 Critical / Must-Have label Sep 28, 2025

Xunzhuo added this to the v0.1 milestone Sep 28, 2025

rootfs approved these changes Sep 28, 2025

View reviewed changes

rootfs merged commit ede160f into main Sep 28, 2025
9 checks passed

This was referenced Sep 28, 2025

[v0.1] Envoy ExtProc integration for GIE #89

Closed

[v0.1] Envoy ExtProc integration for Envoy AI Gateway #90

Closed

srampal reviewed Sep 29, 2025

View reviewed changes

srampal mentioned this pull request Sep 29, 2025

feat(Istio): integrate with Istio gateway via extproc #229

Merged

Aias00 pushed a commit to Aias00/semantic-router that referenced this pull request Oct 4, 2025

feat: support running vsr in kubernetes environment (vllm-project#245)

c344f5b

Signed-off-by: bitliu <[email protected]> Signed-off-by: liuhy <[email protected]>

Xunzhuo deleted the support-k8s branch October 7, 2025 06:48

yossiovadia pushed a commit to yossiovadia/semantic-router that referenced this pull request Oct 8, 2025

feat: support running vsr in kubernetes environment (vllm-project#245)

92dead5

Signed-off-by: bitliu <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: support kubernetes environment #245

feat: support kubernetes environment #245

Uh oh!

Xunzhuo commented Sep 27, 2025

Uh oh!

netlify bot commented Sep 27, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Sep 27, 2025 •

edited

Loading

Uh oh!

Uh oh!

rootfs commented Sep 27, 2025

Uh oh!

Xunzhuo commented Sep 28, 2025

Uh oh!

Xunzhuo commented Sep 28, 2025

Uh oh!

Uh oh!

srampal Sep 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

		@@ -0,0 +1,60 @@
		apiVersion: inference.networking.x-k8s.io/v1alpha2
		kind: InferencePool

feat: support kubernetes environment #245

feat: support kubernetes environment #245

Uh oh!

Conversation

Xunzhuo commented Sep 27, 2025

Uh oh!

netlify bot commented Sep 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for vllm-semantic-router ready!

Uh oh!

github-actions bot commented Sep 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

👥 vLLM Semantic Team Notification

📁 deploy

📁 src

📁 tools

📁 website

📁 Root Directory

📁 config

🎉 Thanks for your contributions!

Uh oh!

Uh oh!

rootfs commented Sep 27, 2025

Uh oh!

Xunzhuo commented Sep 28, 2025

Uh oh!

Xunzhuo commented Sep 28, 2025

Uh oh!

Uh oh!

srampal Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

netlify bot commented Sep 27, 2025 •

edited

Loading

github-actions bot commented Sep 27, 2025 •

edited

Loading

📁 `deploy`

📁 `src`

📁 `tools`

📁 `website`

📁 `Root Directory`

📁 `config`