How to retrive training states (epoch, ‘loss_val’, etc..) from a checkpoint? #13701
Unanswered
acercyc
asked this question in
Lightning Trainer API: Trainer, LightningModule, LightningDataModule
Replies: 1 comment
-
you can load the checkpoint manually using |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I setup a wandb logger and a checkpoint_callback to save model only when the ‘loss_val’ is minimal.
Of course, the traning continues and stops after several epochs from the checkpoint with the lowest ‘loss_val’
How can I find out what the epoch, loss_val, etc. values are for that checkpoint?
It is possible to use wandb log and find it. It would be nice if there were a native way to get the state of a particular checkpoint.
Beta Was this translation helpful? Give feedback.
All reactions