Remove all lazy execution mode from the codebase -> eager is the new default by afierka-intel · Pull Request #996 · vllm-project/vllm-gaudi

afierka-intel · 2026-02-20T09:00:30Z

No description provided.

Signed-off-by: Artur Fierka <artur.fierka@intel.com>

github-actions · 2026-02-20T09:01:26Z

🚧 CI Blocked

The main CI workflow was not started for the following reason:

This is a Draft PR. Please mark it as 'Ready for Review' to trigger the CI.

Copilot

Pull request overview

This pull request removes lazy execution mode from the codebase, making eager execution the new default for the vLLM-HPU (Habana Processing Unit) plugin. The change eliminates all references to PT_HPU_LAZY_MODE=1 (lazy mode) throughout the test scripts, CI/CD configurations, examples, and documentation, effectively standardizing on eager mode (PT_HPU_LAZY_MODE=0 or unset).

Changes:

Removed PT_HPU_LAZY_MODE=1 environment variable settings from all test scripts, examples, and CI/CD workflows
Updated calibration scripts to remove conditional logic that differentiated between lazy and eager modes
Modified test expectations to reflect eager mode as the default

Reviewed changes

Copilot reviewed 16 out of 16 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
tests/upstream_tests/ci_tests.sh	Removed PT_HPU_LAZY_MODE=1 from basic model test commands
tests/unit_tests/run_accuracy_test.sh	Removed export of PT_HPU_LAZY_MODE=1
tests/full_tests/ci_perf_tests.sh	Removed PT_HPU_LAZY_MODE=1 from performance benchmark tests
tests/full_tests/ci_e2e_discoverable_tests.sh	Removed PT_HPU_LAZY_MODE=1 from 30+ test functions covering various model configurations
examples/nixl/run_hpu_disagg_accuracy_test.sh	Removed commented-out PT_HPU_LAZY_MODE export
examples/nixl/run_benchmark_test.sh	Removed PT_HPU_LAZY_MODE=1 export from benchmark configuration
examples/nixl/run_benchmark_profile.sh	Removed PT_HPU_LAZY_MODE=1 export from profiling configuration
examples/nixl/run_accuracy_test.sh	Removed PT_HPU_LAZY_MODE=1 export from accuracy test setup
docs/configuration/warm-up/managing_warm-up.md	Removed PT_HPU_LAZY_MODE=1 from documentation example command
docs/configuration/calibration/calibration_one_node.md	Removed PT_HPU_LAZY_MODE=1 from calibration example for DeepSeek models
calibration/calibrate_model.sh	Simplified conditional logic to always use CONST scale_format for mixtral models (previously conditional on lazy mode)
.jenkins/test_config.yaml	Removed PT_HPU_LAZY_MODE=1 from Jenkins CI test configurations
.github/workflows/pre-merge.yaml	Removed PT_HPU_LAZY_MODE=1 from pre-merge GitHub Actions workflow
.github/workflows/hourly-ci.yaml	Removed PT_HPU_LAZY_MODE=1 from hourly CI GitHub Actions workflow
.github/workflows/create-release-branch.yaml	Removed PT_HPU_LAZY_MODE=1 from release branch creation workflow
.cd/tests/test_vllm_autocalc.py	Updated test to expect PT_HPU_LAZY_MODE=0 instead of =1

Comments suppressed due to low confidence (4)

tests/full_tests/ci_e2e_discoverable_tests.sh:105

The comment "The lazy mode works on 1.24.0-272" is now obsolete since PT_HPU_LAZY_MODE=0 has been removed from the command on line 108. This comment should be updated or removed to reflect that eager mode is now the default and these tests run with eager mode.

# The lazy mode works on 1.24.0-272

.cd/tests/test_vllm_autocalc.py:41

The test is being updated to use PT_HPU_LAZY_MODE=0 instead of PT_HPU_LAZY_MODE=1, which correctly aligns with making eager mode the default. However, this test file still references PT_HPU_LAZY_MODE in the varlist configuration (line 18) and in the test assertion. If lazy mode is being completely removed from the codebase, consider whether this test should be updated to test a different environment variable, or if the PT_HPU_LAZY_MODE variable should be removed from the test configuration entirely.

    monkeypatch.setenv("PT_HPU_LAZY_MODE", "0")
    vg = VarsGenerator(**minimal_config)
    vg.overwrite_params()
    assert vg.context["PT_HPU_LAZY_MODE"] == 0

tests/full_tests/ci_e2e_discoverable_tests.sh:89

The comment "The lazy mode works on 1.24.0-272" is now obsolete since PT_HPU_LAZY_MODE=0 has been removed from the command on line 92. This comment should be updated or removed to reflect that eager mode is now the default and these tests run with eager mode.

# The lazy mode works on 1.24.0-272

tests/full_tests/ci_e2e_discoverable_tests.sh:97

The comment "The lazy mode works on 1.24.0-272" is now obsolete since PT_HPU_LAZY_MODE=0 has been removed from the command on line 100. This comment should be updated or removed to reflect that eager mode is now the default and these tests run with eager mode.

# The lazy mode works on 1.24.0-272

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Signed-off-by: Artur Fierka <artur.fierka@intel.com>

github-actions · 2026-02-20T10:11:43Z

🚧 CI Blocked

The main CI workflow was not started for the following reason:

This is a Draft PR. Please mark it as 'Ready for Review' to trigger the CI.

Signed-off-by: Artur Fierka <artur.fierka@intel.com>

…tching envs Signed-off-by: Artur Fierka <artur.fierka@intel.com>

Remove all lazy execution mode from the codebase -> eager is new default

05c1b64

Signed-off-by: Artur Fierka <artur.fierka@intel.com>

Copilot AI review requested due to automatic review settings February 20, 2026 09:00

Copilot started reviewing on behalf of afierka-intel February 20, 2026 09:01 View session

Copilot AI reviewed Feb 20, 2026

View reviewed changes

Switch default CI docker images to vllm upstream versions

9536cc3

Signed-off-by: Artur Fierka <artur.fierka@intel.com>

afierka-intel marked this pull request as ready for review February 20, 2026 10:13

afierka-intel requested review from adobrzyn, iboiko-habana, kamil-kaczor, ksmusz, mgawarkiewicz-intel, michalkuligowski and xuechendi as code owners February 20, 2026 10:13

github-actions bot mentioned this pull request Feb 20, 2026

🚦 Team Review Dashboard #701

Open

afierka-intel and others added 8 commits February 20, 2026 15:43

Fix base images for GH CI

d2fda11

Signed-off-by: Artur Fierka <artur.fierka@intel.com>

Merge branch 'main' into dev/afierka/eager-migration

371be23

Merge branch 'main' into dev/afierka/eager-migration

bee32df

Merge branch 'main' into dev/afierka/eager-migration

52396df

Merge branch 'main' into dev/afierka/eager-migration

1369724

Merge branch 'main' into dev/afierka/eager-migration

505688c

Disable gemma and deepseek ocr vl tests; add missing runtime_scale_pa…

bc63901

…tching envs Signed-off-by: Artur Fierka <artur.fierka@intel.com>

Merge branch 'main' into dev/afierka/eager-migration

d5cc312

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove all lazy execution mode from the codebase -> eager is the new default#996

Remove all lazy execution mode from the codebase -> eager is the new default#996
afierka-intel wants to merge 10 commits intomainfrom
dev/afierka/eager-migration

afierka-intel commented Feb 20, 2026

Uh oh!

github-actions bot commented Feb 20, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

github-actions bot commented Feb 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

afierka-intel commented Feb 20, 2026

Uh oh!

github-actions bot commented Feb 20, 2026

🚧 CI Blocked

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

github-actions bot commented Feb 20, 2026

🚧 CI Blocked

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants