Skip to content

Conversation

@Xunzhuo
Copy link
Member

@Xunzhuo Xunzhuo commented Sep 27, 2025

What type of PR is this?

feat: support kubernetes environment

What this PR does / why we need it:

This PR added kubernetes support, as well as support in Kind.

Release Notes: Yes

@netlify
Copy link

netlify bot commented Sep 27, 2025

Deploy Preview for vllm-semantic-router ready!

Name Link
🔨 Latest commit b85c2c5
🔍 Latest deploy log https://app.netlify.com/projects/vllm-semantic-router/deploys/68d8efc6fa68ff0008a97fea
😎 Deploy Preview https://deploy-preview-245--vllm-semantic-router.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@github-actions
Copy link

github-actions bot commented Sep 27, 2025

👥 vLLM Semantic Team Notification

The following members have been identified for the changed files in this PR and have been automatically assigned:

📁 deploy

Owners: @rootfs, @Xunzhuo
Files changed:

  • deploy/kubernetes/ai-gateway/README.md
  • deploy/kubernetes/ai-gateway/configuration/config.yaml
  • deploy/kubernetes/ai-gateway/configuration/rbac.yaml
  • deploy/kubernetes/ai-gateway/configuration/redis.yaml
  • deploy/kubernetes/ai-gateway/inference-pool/inference-pool.yaml
  • deploy/kubernetes/README.md
  • deploy/kubernetes/config.yaml
  • deploy/kubernetes/deployment.yaml
  • deploy/kubernetes/kustomization.yaml
  • deploy/kubernetes/namespace.yaml
  • deploy/kubernetes/pvc.yaml
  • deploy/kubernetes/service.yaml

📁 src

Owners: @rootfs, @Xunzhuo, @wangchen615
Files changed:

  • src/semantic-router/pkg/utils/tls/tls.go
  • src/semantic-router/cmd/main.go
  • src/semantic-router/pkg/extproc/endpoint_selection_test.go
  • src/semantic-router/pkg/extproc/request_handler.go
  • src/semantic-router/pkg/extproc/server.go

📁 tools

Owners: @yuluo-yx, @rootfs, @Xunzhuo
Files changed:

  • tools/kind/kind-config.yaml
  • tools/make/kube.mk

📁 website

Owners: @Xunzhuo
Files changed:

  • website/docs/installation/docker-compose.md
  • website/docs/installation/kubernetes.md
  • website/docs/api/router.md
  • website/docs/overview/architecture/envoy-extproc.md
  • website/docs/overview/architecture/system-architecture.md
  • website/docs/tutorials/intelligent-route/reasoning.md
  • website/sidebars.js

📁 Root Directory

Owners: @rootfs, @Xunzhuo
Files changed:

  • Makefile
  • scripts/entrypoint.sh

📁 config

Owners: @rootfs
Files changed:

  • config/envoy-docker.yaml
  • config/envoy.yaml

vLLM

🎉 Thanks for your contributions!

This comment was automatically generated based on the OWNER files in the repository.

@Xunzhuo Xunzhuo force-pushed the support-k8s branch 2 times, most recently from c8d9b80 to 2afeebd Compare September 27, 2025 11:56
@rootfs
Copy link
Collaborator

rootfs commented Sep 27, 2025

ref: #89 #90

@Xunzhuo
Copy link
Member Author

Xunzhuo commented Sep 28, 2025

Blocked by envoyproxy/ai-gateway#1239

@Xunzhuo Xunzhuo marked this pull request as ready for review September 28, 2025 08:13
@Xunzhuo Xunzhuo added the priority/P0 Critical / Must-Have label Sep 28, 2025
@Xunzhuo
Copy link
Member Author

Xunzhuo commented Sep 28, 2025

/hold cancel

@Xunzhuo Xunzhuo added this to the v0.1 milestone Sep 28, 2025
@rootfs rootfs merged commit ede160f into main Sep 28, 2025
9 checks passed
@@ -0,0 +1,60 @@
apiVersion: inference.networking.x-k8s.io/v1alpha2
kind: InferencePool
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This use of InferencePool does not seem to match with the Gateway API Inference SIG's design intent for InferencePool. I do see some reasoning why you chose to do this. However this can potentially cause issues down the road. Is there a design document where you provide some background for this design choice ?

Aias00 pushed a commit to Aias00/semantic-router that referenced this pull request Oct 4, 2025
@Xunzhuo Xunzhuo deleted the support-k8s branch October 7, 2025 06:48
yossiovadia pushed a commit to yossiovadia/semantic-router that referenced this pull request Oct 8, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

priority/P0 Critical / Must-Have

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants