Skip to content

Is there an easy way to test checkpoints every epoch and draw a line chart using a wandb or tensorboard logger? #16762

Discussion options

You must be logged in to vote

Instantiating 1 trainer and validating all the checkpoints solves my problem. And the epoch and global step of training time can be resumed by this way.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by guzy0324
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment