Skip to content

Commit 3d6934e

Browse files
authored
update benchmark doc (#10995)
* update benchmark doc * update by comment
1 parent ab953ba commit 3d6934e

File tree

1 file changed

+11
-0
lines changed

1 file changed

+11
-0
lines changed

benchmark/fluid/README.md

Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -58,3 +58,14 @@ kubectl create -f myjob/
5858
```
5959

6060
The job shall start.
61+
62+
63+
## Notes for Run Fluid Distributed with NCCL2 and RDMA
64+
65+
Before running NCCL2 distributed jobs, please check that whether your node has multiple network
66+
interfaces, try to add the environment variable `export NCCL_SOCKET_IFNAME=eth0` to use your actual
67+
network device.
68+
69+
To run high-performance distributed training, you must prepare your hardware environment to be
70+
able to run RDMA enabled network communication, please check out [this](https://github.com/PaddlePaddle/Paddle/blob/develop/doc/fluid/howto/cluster/nccl2_rdma_training.md)
71+
note for details.

0 commit comments

Comments
 (0)