Skip to content

Commit e40376b

Browse files
committed
fix: 分布式其他卡要提前同步,等 0 卡跑完
Signed-off-by: YdrMaster <[email protected]>
1 parent 91ddf73 commit e40376b

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

models/llama/cuda/src/nccl_parallel.rs

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -172,6 +172,8 @@ fn test_infer() {
172172
);
173173

174174
next.send(pair.idx() as _).unwrap()
175+
} else {
176+
stream.synchronize();
175177
}
176178
}
177179
});

0 commit comments

Comments
 (0)