fix preempted prompts by yangulei · Pull Request #830 · vllm-project/vllm-gaudi

yangulei · 2026-01-16T02:29:58Z

Motivation

The preempted prompts might failed to meet the num_computed_tokens < num_prompt_tokens test and be treated as decoding then cause runtime error.

Changes

add _is_prompt() to check if a request is prompt or not.
consider the num_scheduled_tokens to handle the preempted prompts.
add test for preemption handling to the CI.

Copilot

Pull request overview

This PR fixes a bug where preempted prompts were incorrectly classified as decode requests, causing runtime errors. The fix introduces a dedicated method to properly identify prompt requests by considering both computed tokens and scheduled tokens, which is especially important for handling preempted sequences.

Changes:

Added _is_prompt() method to accurately determine if a request is a prompt, accounting for preempted prompts
Refactored prompt/decode classification logic to use the new helper method
Added preemption handling test to CI

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 2 comments.

File	Description
vllm_gaudi/v1/worker/hpu_model_runner.py	Implements `_is_prompt()` helper method and refactors prompt/decode classification logic to fix preempted prompt handling
tests/full_tests/preemption.py	Adds new test file to verify preemption handling with high memory pressure conditions
tests/full_tests/ci_gsm8k_tests.sh	Adds preemption test function to CI test suite

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

vllm_gaudi/v1/worker/hpu_model_runner.py

tests/full_tests/preemption.py

vllm_gaudi/v1/worker/hpu_model_runner.py

github-actions · 2026-01-20T08:19:36Z

✅ CI Passed

All checks passed successfully against the following vllm commit:
6218034dd7f9a56596e4fd8c8c8fc1d8011ed9c2

github-actions · 2026-01-22T17:02:47Z

✅ CI Passed

All checks passed successfully against the following vllm commit:
6218034dd7f9a56596e4fd8c8c8fc1d8011ed9c2

The preempted prompts might failed to mitch the `num_computed_tokens < num_prompt_tokens` test and be treated as decoding then cause runtime error. - add `_is_prompt()` to check if a request is prompt or not. - consider the `num_scheduled_tokens` to handle the preempted prompts. - add test for preemption handling to the CI. --------- Signed-off-by: Youlei Yang <youlei.yang@intel.com>

Copilot AI review requested due to automatic review settings January 16, 2026 02:29

yangulei requested review from adobrzyn, afierka-intel, iboiko-habana, kamil-kaczor, ksmusz, kzawora-intel, mgawarkiewicz-intel, michalkuligowski and xuechendi as code owners January 16, 2026 02:29

Copilot AI reviewed Jan 16, 2026

View reviewed changes

vllm_gaudi/v1/worker/hpu_model_runner.py Outdated Show resolved Hide resolved

tests/full_tests/preemption.py Show resolved Hide resolved

yangulei commented Jan 16, 2026

View reviewed changes

vllm_gaudi/v1/worker/hpu_model_runner.py Outdated Show resolved Hide resolved

github-actions bot mentioned this pull request Jan 16, 2026

🚦 Team Review Dashboard #701

Open

yangulei force-pushed the preempted_prompt branch 4 times, most recently from b0f4a6e to 06b9b4a Compare January 19, 2026 09:56

yangulei force-pushed the preempted_prompt branch from f414a28 to a456bef Compare January 22, 2026 08:08

yangulei force-pushed the preempted_prompt branch 2 times, most recently from 6bcc307 to 2b0f48c Compare February 3, 2026 01:31

yangulei mentioned this pull request Feb 4, 2026

fix preempted prompts #928

Merged

adobrzyn self-assigned this Feb 5, 2026

yangulei force-pushed the preempted_prompt branch 2 times, most recently from a86d05d to f1bf91d Compare February 26, 2026 08:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix preempted prompts#830

fix preempted prompts#830
yangulei wants to merge 1 commit intovllm-project:mainfrom
yangulei:preempted_prompt

yangulei commented Jan 16, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Jan 20, 2026

Uh oh!

github-actions bot commented Jan 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

yangulei commented Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Changes

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Jan 20, 2026

✅ CI Passed

Uh oh!

github-actions bot commented Jan 22, 2026

✅ CI Passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yangulei commented Jan 16, 2026 •

edited

Loading