WIP - Add vLLM readiness probe script to sim container image #210
  Add this suggestion to a batch that can be applied as a single commit.
  This suggestion is invalid because no changes were made to the code.
  Suggestions cannot be applied while the pull request is closed.
  Suggestions cannot be applied while viewing a subset of changes.
  Only one suggestion per line can be applied in a batch.
  Add this suggestion to a batch that can be applied as a single commit.
  Applying suggestions on deleted lines is not supported.
  You must change the existing code in this line in order to create a valid suggestion.
  Outdated suggestions cannot be applied.
  This suggestion has been applied or marked resolved.
  Suggestions cannot be applied from pending reviews.
  Suggestions cannot be applied on multi-line comments.
  Suggestions cannot be applied while the pull request is queued to merge.
  Suggestion cannot be applied right now. Please check back later.
  
    
  
    
This change integrates the readiness probe script from the llm-d repository PR #330 into llm-d-inference-sim, following the same Dockerfile-only approach used in llm-d PR #330 without requiring any Go code modifications.
The reason for this change is to address issue #300 on the llm-d/llm-d repo. In PR #330, at the moment, E2E testing with sim images breaks due to readiness-probe script not being available in the image
The readiness probe provides comprehensive 3-stage health validation:
Changes included:
The script is available in the container for users who need advanced readiness validation but does not change existing behavior. Users can optionally configure exec-based probes using this script while the default HTTP-based probes continue to work as before.
This addresses the need for more robust model-loading verification in production deployments where the basic HTTP health check may return success before the model is fully loaded and ready to serve requests.