Skip to content

Commit 4d95323

Browse files
committed
Update NCCL_NCHANNELS_PER_NET_PEER section and add "recv" to whitelist
1 parent f6587ec commit 4d95323

File tree

2 files changed

+3
-2
lines changed

2 files changed

+3
-2
lines changed

.github/actions/spelling/allow.txt

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -273,6 +273,7 @@ pytorch
273273
quantumespresso
274274
quasiparticles
275275
quickstart
276+
recv
276277
rgw
277278
ripgrep
278279
rocm

docs/software/communication/nccl.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -22,13 +22,13 @@ While the container engine sets these automatically when using the NCCL hook, th
2222

2323
[_Demystifying NCCL: An In-depth Analysis of GPU Communication Protocols and Algorithms_](https://arxiv.org/abs/2507.04786v2) contains detailed information about NCCL algorithms and protocols, which can be helpful for deciding if your application could benefit from an alternative configuration.
2424

25-
In addition to the above variables, setting `NCCL_NCHANNELS_PER_NET_PEER` can improve send/recv performance:
25+
In addition to the above variables, setting `NCCL_NCHANNELS_PER_NET_PEER` can improve point-to-point performance (operations based directly on send/recv):
2626

2727
```bash
2828
export NCCL_NCHANNELS_PER_NET_PEER=4
2929
```
3030

31-
A value of 4 is generally a good compromise to improve send/recv performance without affecting collectives performance.
31+
A value of 4 is generally a good compromise to improve point-to-point performance without affecting collectives performance.
3232
Setting it to a higher value such as 16 or 32 can still further improve send/recv performance, but may degrade collectives performance, so the optimal value depends on the mix of operations used in an application.
3333
The option is undocumented, but [this issue](https://github.com/NVIDIA/nccl/issues/1272) contains additional details.
3434

0 commit comments

Comments
 (0)