WIP - Add vLLM readiness probe script to sim container image #210

rajinator · 2025-10-09T02:32:57Z

This change integrates the readiness probe script from the llm-d repository PR #330 into llm-d-inference-sim, following the same Dockerfile-only approach used in llm-d PR #330 without requiring any Go code modifications.

The reason for this change is to address issue #300 on the llm-d/llm-d repo. In PR #330, at the moment, E2E testing with sim images breaks due to readiness-probe script not being available in the image

The readiness probe provides comprehensive 3-stage health validation:

Stage 1: Basic health endpoint (/health) responding
Stage 2: Models endpoint (/v1/models) available
Stage 3: Model metadata properly returned (non-empty data)

Changes included:

Add scripts/readiness_probe.sh: Comprehensive health check script that validates vLLM-compatible API readiness with configurable timeouts and flexible host/port parameters
Update Dockerfile: Install curl runtime dependency and copy the readiness probe script to /usr/local/bin/ with executable permissions

The script is available in the container for users who need advanced readiness validation but does not change existing behavior. Users can optionally configure exec-based probes using this script while the default HTTP-based probes continue to work as before.

This addresses the need for more robust model-loading verification in production deployments where the basic HTTP health check may return success before the model is fully loaded and ready to serve requests.

This change integrates the readiness probe script from the llm-d repository pr#300 into llm-d-inference-sim, following the same Dockerfile-only approach used in llm-d pr#300 without requiring any Go code modifications. The readiness probe provides comprehensive 3-stage health validation: - Stage 1: Basic health endpoint (/health) responding - Stage 2: Models endpoint (/v1/models) available - Stage 3: Model metadata properly returned (non-empty data) Changes included: - Add scripts/readiness_probe.sh: Comprehensive health check script that validates vLLM-compatible API readiness with configurable timeouts and flexible host/port parameters - Update Dockerfile: Install curl runtime dependency and copy the readiness probe script to /usr/local/bin/ with executable permissions The script is available in the container for users who need advanced readiness validation but does not change existing behavior. Users can optionally configure exec-based probes using this script while the default HTTP-based probes continue to work as before. This addresses the need for more robust model-loading verification in production deployments where the basic HTTP health check may return success before the model is fully loaded and ready to serve requests. Signed-off-by: rajinator <[email protected]>

mayabar · 2025-10-23T07:13:26Z

@rajinator is it still WIP or ready for review?

rajinator · 2025-10-23T15:24:00Z

@mayabar it's still WIP, waiting on @Gregory-Pereira to test the updated alternate approach in llm-d/llm-d#330

The testing will let us know whether to close or move forward with this

rajinator · 2025-10-25T22:26:10Z

Closing this because llm-d/llm-d#330 has been tested and merged with the alternate approach. Thank you @Gregory-Pereira !

rajinator changed the title ~~WIP - Add vLLM readiness probe script to container image for supporting~~ WIP - Add vLLM readiness probe script to sim container image Oct 9, 2025

rajinator closed this Oct 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

WIP - Add vLLM readiness probe script to sim container image #210

WIP - Add vLLM readiness probe script to sim container image #210

Uh oh!

rajinator commented Oct 9, 2025 •

edited

Loading

Uh oh!

mayabar commented Oct 23, 2025

Uh oh!

rajinator commented Oct 23, 2025

Uh oh!

rajinator commented Oct 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

WIP - Add vLLM readiness probe script to sim container image #210

WIP - Add vLLM readiness probe script to sim container image #210

Uh oh!

Conversation

rajinator commented Oct 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mayabar commented Oct 23, 2025

Uh oh!

rajinator commented Oct 23, 2025

Uh oh!

rajinator commented Oct 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rajinator commented Oct 9, 2025 •

edited

Loading