Anyone know why the Fault-Tolerant Training document was removed from the latest pytorch-lighting document? #17811
Answered
by
justusschock
Keiku
asked this question in
Lightning Trainer API: Trainer, LightningModule, LightningDataModule
-
Anyone know why the Fault-Tolerant Training document was removed from the latest pytorch-lighting document? It seems to have existed at least as of version 1.7.7. |
Beta Was this translation helpful? Give feedback.
Answered by
justusschock
Jun 12, 2023
Replies: 1 comment 1 reply
-
Hey, this feature was removes with PL 2.0.0 as it was having to high impact on complexity and thereby training speed. You can still manually do this by saving the random seed of your dataloaders/datasets and restoring them upon loading. |
Beta Was this translation helpful? Give feedback.
1 reply
Answer selected by
Keiku
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hey, this feature was removes with PL 2.0.0 as it was having to high impact on complexity and thereby training speed. You can still manually do this by saving the random seed of your dataloaders/datasets and restoring them upon loading.