-
Notifications
You must be signed in to change notification settings - Fork 18
Resuming Training #14
Copy link
Copy link
Open
Description
Hello! Thank you for open-sourcing EfficientZeroV2.
I have implemented saving the ReplayBuffer, optimizer, and scheduler every save_ckpt_interval, along with the model's weights. Now, I need to implement resuming training and would like some guidance on the best approach. Specifically, I’m wondering if there’s any other data I should save to ensure training can be resumed seamlessly. What is the proper way to resume training?
Additionally, could you please share some tips on how to debug EfficientZeroV2? I am using PyCharm as my IDE, and I am encountering an error related to closed sockets during debugging. I also tried initializing Ray with local_mode=True, but that led to another error.
Thanks in advance!
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels