-
Hello all. Here is the info about the HPC hardware: Intel Xeon CPU E5-2690 v2 @ 3.00GHz; Mem 124GB; 20 cores per node available; 49 nodes available. The dataset is the same, and the batch size is all set to be 1 for testing. I have attached the two log files and input scripts, and I really appreciate your help! log8nodes-20coreseach-btch1.txt |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
MPI training applies data parallelism. The time for each step will not be reduced but the batch size will be increased. The performance of multiple nodes may be limited by your network. |
Beta Was this translation helpful? Give feedback.
MPI training applies data parallelism. The time for each step will not be reduced but the batch size will be increased.
The performance of multiple nodes may be limited by your network.