Skip to content

Adding ci_calibration_smoke_tests.sh into v0.16.0#1042

Merged
mgawarkiewicz-intel merged 1 commit intovllm-project:releases/v0.16.0from
iboiko-habana:ci_v0.16.0_calibration
Feb 25, 2026
Merged

Adding ci_calibration_smoke_tests.sh into v0.16.0#1042
mgawarkiewicz-intel merged 1 commit intovllm-project:releases/v0.16.0from
iboiko-habana:ci_v0.16.0_calibration

Conversation

@iboiko-habana
Copy link
Collaborator

No description provided.

Signed-off-by: Iryna Boiko <iboiko@habana.ai>
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Adds a new CI-oriented smoke-test script to validate that the FP8 calibration workflow can run end-to-end for a couple of representative models.

Changes:

  • Introduces tests/calibration_tests/ci_calibration_smoke_tests.sh to run lightweight calibration runs (batch=1, limit=1).
  • Adds helper functions for per-test temp output cleanup and a simple function-dispatch entrypoint.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

rm -rf "${CALIBRATION_OUTPUT_DIR}"
fi
}

Copy link

Copilot AI Feb 25, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

On failure, set -e will exit before any later cleanup runs, which can leave tmp-calibration-output behind. Consider registering cleanup_calibration_output via a trap (e.g., on EXIT) so artifacts are removed even when a calibration step fails.

Suggested change
# Ensure calibration output is cleaned on any script exit (including failures)
trap cleanup_calibration_output EXIT

Copilot uses AI. Check for mistakes.
echo "If no function_name is provided, all tests will be run."
echo ""
echo "Available functions:"
declare -F | awk '{print " - " $3}' | grep --color=never "run_"
Copy link

Copilot AI Feb 25, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

usage() only lists functions matching run_, but users can also invoke launch_all_tests (and it’s the default). Consider including launch_all_tests in the output (or update the help text) so the usage info matches actual behavior.

Suggested change
declare -F | awk '{print " - " $3}' | grep --color=never "run_"
declare -F | awk '{print " - " $3}' | grep --color=never "run_"
echo " - launch_all_tests (default: runs all tests sequentially)"

Copilot uses AI. Check for mistakes.
Comment on lines +36 to +40
-d "${CALIBRATION_DATASET}" \
-o "${CALIBRATION_OUTPUT_DIR}" \
-b ${BATCH_SIZE} \
-l ${LIMIT} \
-t 1
Copy link

Copilot AI Feb 25, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

calibrate_model.sh resolves -o via realpath, which fails if the output directory doesn’t exist. Since cleanup_calibration_output deletes the directory, recreate it (e.g., mkdir -p) before invoking calibration so this smoke test doesn’t fail immediately.

Copilot uses AI. Check for mistakes.
Comment on lines +42 to +45
if [ $? -ne 0 ]; then
echo "Error: Calibration failed for ibm-granite/granite-3.3-2b-instruct" >&2
exit 1
fi
Copy link

Copilot AI Feb 25, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This script uses set -e, so if calibrate_model.sh fails the script will exit immediately and this $? check is dead code. If you want a custom error message, wrap the command in if ! ...; then ...; fi; otherwise remove the explicit status check.

Copilot uses AI. Check for mistakes.
Comment on lines +63 to +66
if [ $? -ne 0 ]; then
echo "Error: Calibration failed for Qwen/Qwen2.5-0.5B-Instruct" >&2
exit 1
fi
Copy link

Copilot AI Feb 25, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same issue as above: with set -e enabled, this $? check will never run on failure. Prefer if ! ...; then ...; fi (for a custom error message) or drop the check.

Copilot uses AI. Check for mistakes.
@mgawarkiewicz-intel mgawarkiewicz-intel enabled auto-merge (squash) February 25, 2026 13:29
@mgawarkiewicz-intel mgawarkiewicz-intel merged commit 988ecd2 into vllm-project:releases/v0.16.0 Feb 25, 2026
13 of 28 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants