fix multi-GPU validation to include pipeline parallelism by nightflight-dk · Pull Request #116 · triton-inference-server/vllm_backend

nightflight-dk · 2025-12-29T07:06:29Z

This PR updates the validation logic in TritonPythonModel to correctly identify multi-GPU configurations that use Pipeline Parallelism (PP).

Previously, the check only considered Tensor Parallelism (TP) size (tensor_parallel_size > 1) when verifying if a model configured with KIND_GPU (intended for single-GPU) was attempting to run in a multi-GPU setup. This meant that models using Pipeline Parallelism (where pipeline_parallel_size > 1) could potentially bypass this check if tensor_parallel_size was 1.

Changes:

Retrieve pipeline_parallel_size from vllm_engine_config (defaulting to 1).
Update the validation condition to check if the total parallelism (tp_size * pp_size) exceeds 1.
Ensures that users are correctly prompted to use KIND_MODEL for any multi-GPU configuration, whether it involves TP, PP, or both.
Testing:

Verified that setting pipeline_parallel_size > 1 with
KIND_GPU now raises the expected ValueError.

nightflight-dk · 2026-01-06T03:58:04Z

@yinggeh @mc-nv pls review, thanks

yinggeh

LGTM

yinggeh · 2026-01-07T06:49:58Z

Can you also update the copyright to 2026 in the first line?

nightflight-dk · 2026-01-09T03:32:33Z

@yinggeh done, ready to merge

yinggeh · 2026-01-09T05:17:29Z

@nightflight-dk You should only update copyrights of the file you edited, which is src/model.py.

nightflight-dk · 2026-01-09T05:21:50Z

@nightflight-dk You should only update copyrights of the file you edited, which is src/model.py.
done

yinggeh · 2026-01-09T05:27:22Z

Thanks for contributing. This change will be included in Triton 26.01 release.

Fix multi-GPU validation to include pipeline parallelism

d932fa0

yinggeh previously approved these changes Jan 7, 2026

View reviewed changes

yinggeh added the bug Something isn't working label Jan 7, 2026

yinggeh self-requested a review January 7, 2026 07:24

copyright update

d5edd1b

nightflight-dk dismissed yinggeh’s stale review via d5edd1b January 9, 2026 03:31

revert cp

df40cc2

yinggeh approved these changes Jan 9, 2026

View reviewed changes

yinggeh merged commit 41cb2c6 into triton-inference-server:main Jan 9, 2026
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix multi-GPU validation to include pipeline parallelism#116

fix multi-GPU validation to include pipeline parallelism#116
yinggeh merged 3 commits intotriton-inference-server:mainfrom
nightflight-dk:fix/pipeline-parallel-validation

nightflight-dk commented Dec 29, 2025

Uh oh!

nightflight-dk commented Jan 6, 2026

Uh oh!

yinggeh left a comment

Uh oh!

yinggeh commented Jan 7, 2026

Uh oh!

nightflight-dk commented Jan 9, 2026

Uh oh!

yinggeh commented Jan 9, 2026

Uh oh!

nightflight-dk commented Jan 9, 2026

Uh oh!

Uh oh!

yinggeh commented Jan 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

nightflight-dk commented Dec 29, 2025

Uh oh!

nightflight-dk commented Jan 6, 2026

Uh oh!

yinggeh left a comment

Choose a reason for hiding this comment

Uh oh!

yinggeh commented Jan 7, 2026

Uh oh!

nightflight-dk commented Jan 9, 2026

Uh oh!

yinggeh commented Jan 9, 2026

Uh oh!

nightflight-dk commented Jan 9, 2026

Uh oh!

Uh oh!

yinggeh commented Jan 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants