num_workers > 0 gets stuck #11447
Unanswered
mlstoic
asked this question in
Lightning Trainer API: Trainer, LightningModule, LightningDataModule
Replies: 3 comments 1 reply
-
could you share part of your code? |
Beta Was this translation helpful? Give feedback.
1 reply
-
Hi @mlstoic , do you fix the problem! Can you help me, i think i have a problem the same with your . |
Beta Was this translation helpful? Give feedback.
0 replies
-
@mlstoic have you ever solved the problem? same problem here |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
When I use num_workers =0 for train_dataloader, val_dataloader, test_dataloader, the training finishes one epoch %100 quickly (although I get loss = NaN and I have not figure out what the issue is) with some warning that I should use larger num_workers and it suggests me to use num_workers = 16.
However, if I use num_workers > 0 it gets stuck at the validation sanity check and it does not go anywhere.
Can someone please shed some light on what the issue might be ? Thank you.
Beta Was this translation helpful? Give feedback.
All reactions