Is there an easy way to test checkpoints every epoch and draw a line chart using a wandb or tensorboard logger? #16762

guzy0324 · 2023-02-15T06:03:25Z

guzy0324
Feb 15, 2023

I have trained my model and saved checkpoints every epoch. After training, I realize that the validation result which is a line chart is not reliable because the size of validation set is too small. Thus, I want to use a larger validation set to validate all the checkpoints and draw a line chart. I have a solution to this problem which is to validate every checkpoint in separate runs then draw a line chart using matplotlib. However, the solution is not graceful. Is there an easy way such as a parameter of Trainer or rewriting methods of LightningModule to do this job?

Answered by guzy0324

Feb 15, 2023

Instantiating 1 trainer and validating all the checkpoints solves my problem. And the epoch and global step of training time can be resumed by this way.

View full answer

guzy0324 · 2023-02-15T10:05:22Z

guzy0324
Feb 15, 2023
Author

Instantiating 1 trainer and validating all the checkpoints solves my problem. And the epoch and global step of training time can be resumed by this way.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Is there an easy way to test checkpoints every epoch and draw a line chart using a wandb or tensorboard logger? #16762

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Is there an easy way to test checkpoints every epoch and draw a line chart using a wandb or tensorboard logger? #16762

Uh oh!

guzy0324 Feb 15, 2023

Replies: 1 comment

Uh oh!

guzy0324 Feb 15, 2023 Author

guzy0324
Feb 15, 2023

guzy0324
Feb 15, 2023
Author