feat: add llm-katan to k8s deploy #466

JaredforReal · 2025-10-17T14:35:47Z

What this PR does / why we need it:

add llm-katan to k8s deploy
separate core and llm-katan mode, as what we do in Docker Compose deploy
update README and docs
expand pvc size
fix inference-pool selector error

netlify · 2025-10-17T14:35:53Z

✅ Deploy Preview for vllm-semantic-router ready!

Name	Link
🔨 Latest commit	`320d9d9`
🔍 Latest deploy log	https://app.netlify.com/projects/vllm-semantic-router/deploys/68f256244403ca00087df27d
😎 Deploy Preview	https://deploy-preview-466--vllm-semantic-router.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

Signed-off-by: JaredforReal <[email protected]>

github-actions · 2025-10-17T14:44:00Z

👥 vLLM Semantic Team Notification

The following members have been identified for the changed files in this PR and have been automatically assigned:

📁 `deploy`

Owners: @rootfs, @Xunzhuo
Files changed:

deploy/kubernetes/base/kustomization.yaml
deploy/kubernetes/deployment.katan.yaml
deploy/kubernetes/overlays/core/kustomization.yaml
deploy/kubernetes/overlays/llm-katan/kustomization.yaml
deploy/kubernetes/README.md
deploy/kubernetes/ai-gateway/inference-pool/inference-pool.yaml
deploy/kubernetes/config.yaml
deploy/kubernetes/deployment.yaml
deploy/kubernetes/kustomization.yaml
deploy/kubernetes/pvc.yaml

📁 `website`

Owners: @Xunzhuo, @rootfs, @yuluo-yx
Files changed:

website/docs/installation/kubernetes.md

🎉 Thanks for your contributions!

This comment was automatically generated based on the OWNER files in the repository.

yossiovadia

look solid for moving forward, but consider adding validation for the Qwen model download and possibly making the init container more robust in handling download failures, but it can be a future enhancement.

JaredforReal requested review from Xunzhuo and rootfs as code owners October 17, 2025 14:35

JaredforReal added 4 commits October 17, 2025 22:43

expand pvc size & fix inference-pool selector error

e9d38c0

Signed-off-by: JaredforReal <[email protected]>

add llm-katan to k8s

a6f1c3d

Signed-off-by: JaredforReal <[email protected]>

seperate core and llm-katan

415ab52

Signed-off-by: JaredforReal <[email protected]>

update k8s install docs

320d9d9

Signed-off-by: JaredforReal <[email protected]>

JaredforReal force-pushed the fix-k8s branch from 4faa35d to 320d9d9 Compare October 17, 2025 14:43

github-actions bot assigned rootfs and Xunzhuo Oct 17, 2025

JaredforReal marked this pull request as draft October 17, 2025 14:59

yossiovadia self-requested a review October 17, 2025 17:20

yossiovadia approved these changes Oct 17, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add llm-katan to k8s deploy #466

feat: add llm-katan to k8s deploy #466

Uh oh!

JaredforReal commented Oct 17, 2025

Uh oh!

netlify bot commented Oct 17, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Oct 17, 2025

Uh oh!

yossiovadia left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

feat: add llm-katan to k8s deploy #466

Are you sure you want to change the base?

feat: add llm-katan to k8s deploy #466

Uh oh!

Conversation

JaredforReal commented Oct 17, 2025

Uh oh!

netlify bot commented Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for vllm-semantic-router ready!

Uh oh!

github-actions bot commented Oct 17, 2025

👥 vLLM Semantic Team Notification

📁 deploy

📁 website

🎉 Thanks for your contributions!

Uh oh!

yossiovadia left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

netlify bot commented Oct 17, 2025 •

edited

Loading

📁 `deploy`

📁 `website`