Data loader fails with multi-gpu #17935
Unanswered
moradza
asked this question in
DDP / multi-GPU / multi-node
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
My model gets trained on a single gpu but when I want to scale it up to multi-gpu it gets killed with no obvious error message. I am feeling it has to do with data loader memory use. Anybody has similar experience?
Strategy = DDP
Beta Was this translation helpful? Give feedback.
All reactions