Skip to content

Commit f077eca

Browse files
soulitzerpytorchmergebot
authored andcommitted
Fix inductor collective runtime units (pytorch#168055)
Pull Request resolved: pytorch#168055 Approved by: https://github.com/eellison ghstack dependencies: pytorch#166536
1 parent 57f36c9 commit f077eca

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

torch/_inductor/comm_analysis.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -321,7 +321,7 @@ def estimate_nccl_collective_runtime_impl(
321321

322322
def estimate_nccl_collective_runtime(node: ir.IRNode) -> float:
323323
"""
324-
Returns estimated NCCL collective runtime in nanoseconds (ns).
324+
Returns estimated NCCL collective runtime in nanoseconds (ms).
325325
326326
The following heuristics are copied from https://github.com/NVIDIA/nccl/blob/master/src/graph/tuning.cc.
327327
We aim to estimate the runtime as accurately as possible.
@@ -355,7 +355,7 @@ def estimate_nccl_collective_runtime_from_fx_node(
355355
use_nccl_estimator: bool = True,
356356
) -> float:
357357
"""
358-
Returns estimated NCCL collective runtime in nanoseconds (ns).
358+
Returns estimated NCCL collective runtime in nanoseconds (ms).
359359
360360
The following heuristics are copied from https://github.com/NVIDIA/nccl/blob/master/src/graph/tuning.cc.
361361
We aim to estimate the runtime as accurately as possible.

0 commit comments

Comments
 (0)