Skip to content

"resume from checkpoint" lead to CUDA out of memory #11563

Discussion options

You must be logged in to vote

I solved the problem after setting the strategy to 'ddp'.

Replies: 3 comments 1 reply

Comment options

You must be logged in to vote
1 reply
@Defiler24
Comment options

Comment options

You must be logged in to vote
0 replies
Answer selected by Defiler24
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment