generated from kubernetes/kubernetes-template-project
-
Notifications
You must be signed in to change notification settings - Fork 184
Closed
Labels
kind/bugCategorizes issue or PR as related to a bug.Categorizes issue or PR as related to a bug.needs-triageIndicates an issue or PR lacks a `triage/foo` label and requires one.Indicates an issue or PR lacks a `triage/foo` label and requires one.
Description
What happened:
Pods not coming up when for CPU deployment:
$ k get pods
NAME READY STATUS RESTARTS AGE
vllm-llama3-8b-instruct-586778f79c-hb4vp 0/1 Init:0/1 0 6m15s
vllm-llama3-8b-instruct-586778f79c-qjk94 0/1 Init:0/1 0 6m15s
vllm-llama3-8b-instruct-586778f79c-w6pjh 0/1 Init:0/1 0 6m15s
Upon looking further, it seems the container lora is not starting:
$ kubectl logs vllm-llama3-8b-instruct-586778f79c-hb4vp
Defaulted container "lora" out of: lora, lora-adapter-syncer (init)
Error from server (BadRequest): container "lora" in pod "vllm-llama3-8b-instruct-586778f79c-7gmsb" is waiting to start: PodInitializing
What you expected to happen:
How to reproduce it (as minimally and precisely as possible):
Try steps for CPU deployment from getting started guide: https://gateway-api-inference-extension.sigs.k8s.io/guides/
Anything else we need to know?:
Environment:
- Kubernetes version (use
kubectl version): - Inference extension version (use
git describe --tags --dirty --always): - Cloud provider or hardware configuration:
- Install tools:
- Others:
Metadata
Metadata
Assignees
Labels
kind/bugCategorizes issue or PR as related to a bug.Categorizes issue or PR as related to a bug.needs-triageIndicates an issue or PR lacks a `triage/foo` label and requires one.Indicates an issue or PR lacks a `triage/foo` label and requires one.