Skip to content

Commit 4708f13

Browse files
authored
[Bugfix] Fix default behavior/fallback for pp in v1 (#16057)
Signed-off-by: mgoin <[email protected]>
1 parent a6d042d commit 4708f13

File tree

1 file changed

+3
-2
lines changed

1 file changed

+3
-2
lines changed

vllm/engine/arg_utils.py

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1521,8 +1521,9 @@ def _is_v1_supported_oracle(self, model_config: ModelConfig) -> bool:
15211521
# PP is supported on V1 with Ray distributed executor,
15221522
# but off for MP distributed executor for now.
15231523
if (self.pipeline_parallel_size > 1
1524-
and self.distributed_executor_backend == "mp"
1525-
and _warn_or_fallback("PP (MP distributed executor)")):
1524+
and self.distributed_executor_backend != "ray"):
1525+
name = "Pipeline Parallelism without Ray distributed executor"
1526+
_raise_or_fallback(feature_name=name, recommend_to_remove=False)
15261527
return False
15271528

15281529
# ngram is supported on V1, but off by default for now.

0 commit comments

Comments
 (0)