Skip to content

Pods not coming up for CPU deployment #1535

@rahulgurnani

Description

@rahulgurnani

What happened:

Pods not coming up when for CPU deployment:

$ k get pods
NAME                                       READY   STATUS     RESTARTS   AGE
vllm-llama3-8b-instruct-586778f79c-hb4vp   0/1     Init:0/1   0          6m15s
vllm-llama3-8b-instruct-586778f79c-qjk94   0/1     Init:0/1   0          6m15s
vllm-llama3-8b-instruct-586778f79c-w6pjh   0/1     Init:0/1   0          6m15s

Upon looking further, it seems the container lora is not starting:

$ kubectl logs vllm-llama3-8b-instruct-586778f79c-hb4vp
Defaulted container "lora" out of: lora, lora-adapter-syncer (init)
Error from server (BadRequest): container "lora" in pod "vllm-llama3-8b-instruct-586778f79c-7gmsb" is waiting to start: PodInitializing

What you expected to happen:

How to reproduce it (as minimally and precisely as possible):

Try steps for CPU deployment from getting started guide: https://gateway-api-inference-extension.sigs.k8s.io/guides/

Anything else we need to know?:

Environment:

  • Kubernetes version (use kubectl version):
  • Inference extension version (use git describe --tags --dirty --always):
  • Cloud provider or hardware configuration:
  • Install tools:
  • Others:

Metadata

Metadata

Assignees

No one assigned

    Labels

    kind/bugCategorizes issue or PR as related to a bug.needs-triageIndicates an issue or PR lacks a `triage/foo` label and requires one.

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions