How to do contrastive learning with accumulate_grad_batches? #19132
Unanswered
yipliu
asked this question in
DDP / multi-GPU / multi-node
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I have found an excellent solution for contrastive learning with DDP. However, a more difficult scenario is to use
accumulate_grad_batches
andDDP
for contrastive learning.For a clearer discussion, suppose I am training with
accumulate_grad_batches=4
with two GPUs.Anyone can help me?
Beta Was this translation helpful? Give feedback.
All reactions