Resuming model training from last checkpoint #9848
Replies: 1 comment
-
Hi @crtnx ,
Resuming training shouldn't lose the progress but yes, the epoch count in the UI starts over. So if your
Personally I use Also, I'm taking a look at your results and it seems unnecessary to resume training further at 99.7 (it's pretty high already and you might want to check overfitting instead) 🤔? Unless you're running this on a toy dataset. 😉 |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
I am aware of the discussion in this post #8176 however I'd like to clarify some points. When I do resume the training from the last checkpoint, the epoch count starts over and potentially the resumed training losing a part of a progress done before? Here the output when the training just started:
Here is the resumed training:
Another point which is not clear - should I specify 'model-last' or 'model-best' as a source for resume?
Beta Was this translation helpful? Give feedback.
All reactions