Skip to content

Commit e47927e

Browse files
authored
[None][fix] change allreduce workspace dtype to torch.int64 to avoid overflow (NVIDIA#9479)
Signed-off-by: Zhenhuan Chen <[email protected]>
1 parent 3ada0bf commit e47927e

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

tensorrt_llm/plugin/plugin.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -738,7 +738,7 @@ def allocate_allreduce_fusion_workspace(
738738
3 * lamport_buffers_size,
739739
)
740740
flag_buffer = torch.tensor([0, 0, 0, lamport_buffers_size, 0],
741-
dtype=torch.int,
741+
dtype=torch.int64,
742742
device="cuda")
743743
buffers = [ipc_buffers, ipc_barriers, lamport_buffers, flag_buffer]
744744

0 commit comments

Comments
 (0)