Adding ci_calibration_smoke_tests.sh into v0.16.0#1042

Merged

mgawarkiewicz-intel merged 1 commit intovllm-project:releases/v0.16.0from

iboiko-habana:ci_v0.16.0_calibration

Feb 25, 2026

Collaborator

iboiko-habana commented Feb 25, 2026

No description provided.


          Adding ci_calibration_smoke_tests.sh into v0.16.0

1b1a7c8

Signed-off-by: Iryna Boiko <iboiko@habana.ai>

Copilot AI review requested due to automatic review settings

February 25, 2026 13:08

iboiko-habana requested review from mgawarkiewicz-intel, piotrbocian and wpyszka as code owners

February 25, 2026 13:08

iboiko-habana added the skip-gaudi-tests label

Copilot started reviewing on behalf of iboiko-habana

February 25, 2026 13:08

Copilot AI reviewed

View reviewed changes

Contributor

Copilot AI left a comment

Pull request overview

Adds a new CI-oriented smoke-test script to validate that the FP8 calibration workflow can run end-to-end for a couple of representative models.

Changes:

Introduces tests/calibration_tests/ci_calibration_smoke_tests.sh to run lightweight calibration runs (batch=1, limit=1).
Adds helper functions for per-test temp output cleanup and a simple function-dispatch entrypoint.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

tests/calibration_tests/ci_calibration_smoke_tests.sh

+                      rm -rf "${CALIBRATION_OUTPUT_DIR}"
+                  fi
+              }

Copilot AI Feb 25, 2026

On failure, set -e will exit before any later cleanup runs, which can leave tmp-calibration-output behind. Consider registering cleanup_calibration_output via a trap (e.g., on EXIT) so artifacts are removed even when a calibration step fails.

Suggested change

      
            # Ensure calibration output is cleaned on any script exit (including failures)
          
            trap cleanup_calibration_output EXIT

Copilot uses AI. Check for mistakes.

tests/calibration_tests/ci_calibration_smoke_tests.sh

+                  echo "If no function_name is provided, all tests will be run."
+                  echo ""
+                  echo "Available functions:"
+                  declare -F | awk '{print "  - " $3}' | grep --color=never "run_"

Copilot AI Feb 25, 2026

usage() only lists functions matching run_, but users can also invoke launch_all_tests (and it’s the default). Consider including launch_all_tests in the output (or update the help text) so the usage info matches actual behavior.

Suggested change

      
                declare -F | awk '{print "  - " $3}' | grep --color=never "run_"
          
                declare -F | awk '{print "  - " $3}' | grep --color=never "run_"
          
                echo "  - launch_all_tests (default: runs all tests sequentially)"

Copilot uses AI. Check for mistakes.

tests/calibration_tests/ci_calibration_smoke_tests.sh

Comment on lines +36 to +40

+                      -d "${CALIBRATION_DATASET}" \
+                      -o "${CALIBRATION_OUTPUT_DIR}" \
+                      -b ${BATCH_SIZE} \
+                      -l ${LIMIT} \
+                      -t 1

Copilot AI Feb 25, 2026

calibrate_model.sh resolves -o via realpath, which fails if the output directory doesn’t exist. Since cleanup_calibration_output deletes the directory, recreate it (e.g., mkdir -p) before invoking calibration so this smoke test doesn’t fail immediately.

Copilot uses AI. Check for mistakes.

tests/calibration_tests/ci_calibration_smoke_tests.sh

Comment on lines +42 to +45

+                  if [ $? -ne 0 ]; then
+                      echo "Error: Calibration failed for ibm-granite/granite-3.3-2b-instruct" >&2
+                      exit 1
+                  fi

Copilot AI Feb 25, 2026

This script uses set -e, so if calibrate_model.sh fails the script will exit immediately and this $? check is dead code. If you want a custom error message, wrap the command in if ! ...; then ...; fi; otherwise remove the explicit status check.

Copilot uses AI. Check for mistakes.

tests/calibration_tests/ci_calibration_smoke_tests.sh

Comment on lines +63 to +66

+                  if [ $? -ne 0 ]; then
+                      echo "Error: Calibration failed for Qwen/Qwen2.5-0.5B-Instruct" >&2
+                      exit 1
+                  fi

Copilot AI Feb 25, 2026

Same issue as above: with set -e enabled, this $? check will never run on failure. Prefer if ! ...; then ...; fi (for a custom error message) or drop the check.

Copilot uses AI. Check for mistakes.

afierka-intel approved these changes

View reviewed changes

mgawarkiewicz-intel enabled auto-merge (squash)

February 25, 2026 13:29

mgawarkiewicz-intel approved these changes

View reviewed changes

mgawarkiewicz-intel merged commit 988ecd2 into vllm-project:releases/v0.16.0

13 of 28 checks passed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

Copilot code review Copilot Copilot left review comments

mgawarkiewicz-intel mgawarkiewicz-intel approved these changes

afierka-intel afierka-intel approved these changes

wpyszka Awaiting requested review from wpyszka wpyszka is a code owner

piotrbocian Awaiting requested review from piotrbocian piotrbocian is a code owner

Labels

skip-gaudi-tests