Skip to content

Commit 6eb84a0

Browse files
nanz-nvvasunvidia
authored andcommitted
Reenable overlapping of stashing kernels
1 parent a53a36e commit 6eb84a0

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

megatron/core/transformer/moe/paged_stash.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -525,7 +525,7 @@ def __init__(self):
525525
"""Initialize the manager with queues and dedicated CUDA streams."""
526526
# allocate streams and events for synchronization
527527
self.enabled = False
528-
self._pack_stream = torch.cuda.current_stream()#torch.cuda.Stream()
528+
self._pack_stream = torch.cuda.Stream()
529529
# Currently paged stashing is not stream-safe, so use the same stream for packing
530530
# and unpacking
531531
self._unpack_stream = self._pack_stream

0 commit comments

Comments
 (0)