We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent f0d610a commit 8dd0671Copy full SHA for 8dd0671
vllm/v1/worker/gpu_model_runner.py
@@ -1107,7 +1107,7 @@ def execute_model(
1107
else:
1108
mm_embeds = []
1109
1110
- if self.is_multimodal_model:
+ if self.is_multimodal_model and get_pp_group().is_first_rank:
1111
# NOTE(woosuk): To unify token ids and soft tokens (vision
1112
# embeddings), we always use embeddings (rather than token ids)
1113
# as input to the multimodal model, even when the input is text.
0 commit comments