Skip to content

Commit 8cdf492

Browse files
committed
topo: add NIC to numa domain on g5.48xlarge platforms
Since G5 platforms did not support PCIe technologies properly, the NIC generated by NCCL in the topology file reported a numa domain of -1. NCCL further assigns it the same system ID as the CPU. CPUs with the same system ID are not connected to each other during topology generation. This results in a seg fault during topology path computation as GPUs are not able to find a path to the NIC. Signed-off-by: Sunita Bhaskaran <[email protected]>
1 parent 8df99dc commit 8cdf492

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

topology/g5.48xl-topo.xml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -21,5 +21,6 @@ virtual machine.
2121
<pci busid="0000:00:1b.0"/>
2222
<pci busid="0000:00:1c.0"/>
2323
<pci busid="0000:00:1d.0"/>
24+
<pci busid="0000:00:15.0"/>
2425
</cpu>
2526
</system>

0 commit comments

Comments
 (0)