Why is my gpu-util low? #7082
Answered
by
awaelchli
yllgl
asked this question in
DDP / multi-GPU / multi-node
-
Beta Was this translation helpful? Give feedback.
Answered by
awaelchli
Apr 18, 2021
Replies: 1 comment 5 replies
-
When you compare the two implementations, make sure to leave out as many changing variables as possible. For example, since you train with DDP, run it only on 2 GPUs so that you can be sure it's not bottlenecked by CPU. I don't know the Dali data loader very well, but I doubt that they can guarantee a throughput increase for all use cases. |
Beta Was this translation helpful? Give feedback.
5 replies
Answer selected by
yllgl
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
When you compare the two implementations, make sure to leave out as many changing variables as possible. For example, since you train with DDP, run it only on 2 GPUs so that you can be sure it's not bottlenecked by CPU. I don't know the Dali data loader very well, but I doubt that they can guarantee a throughput increase for all use cases.