-
Notifications
You must be signed in to change notification settings - Fork 3.6k
Closed
Description
Hi, I am currently working on an asynchronous version of TorchCheckpointIO
that utilizes multiprocessing
to save the checkpoints without blocking training. During this, I stumbled across this TODO and wondered whether it can be resolved, i.e. whether I need to duplicate that logic in my implementation?
Metadata
Metadata
Labels
checkpointingRelated to checkpointingRelated to checkpointing