You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
topo: add NIC to numa domain on g5.48xlarge platforms
Since G5 platforms did not support PCIe technologies properly,
the NIC generated by NCCL in the topology file reported a numa
domain of -1. NCCL further assigns it the same system ID as the CPU.
CPUs with the same system ID are not connected to each other
during topology generation. This results in a seg fault during
topology path computation as GPUs are not able to find a
path to the NIC.
Signed-off-by: Sunita Bhaskaran <[email protected]>
0 commit comments