Single GPU DDP Lightning implementation is 3x slower than Pytorch - Help needed #8211
Unanswered
sreeaurovindh
asked this question in
DDP / multi-GPU / multi-node
Replies: 1 comment 2 replies
-
What effect would DDP have on a single GPU? Did you try training with multiple GPUs? |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I am trying to convert my Pytorch code into Lightning code with the goal of scaling up and reducing my training time. I am using DDP with single GPU and comparing it against a non lightning Pytorch implementation. The lightning implementation is 3x slower.The following is the simple profiler output.
If you have any recommendations on what I can look out for to debug , please let me know or any logs that may help, please let me know
Beta Was this translation helpful? Give feedback.
All reactions