How would one accomplish distributed training for SimCLR? (i.e whole batch needs to be aggregated before calculating the loss) #18312

jeffwillette · 2023-08-15T03:31:50Z

jeffwillette
Aug 15, 2023

I looked through the simclr example here (https://lightning.ai/docs/pytorch/stable/notebooks/course_UvA-DL/13-contrastive-learning.html) and noticed that it shows training SimCLR on a single node. PL seems like it is straightforward to extend to multiple GPU's, but this example will calculate the loss on each GPU individually, whereas SimCLR would ideally aggregate all the output features together before calculating the loss on the entire distributed batch size.

Is there a straightforward way to accomplish this with PL?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How would one accomplish distributed training for SimCLR? (i.e whole batch needs to be aggregated before calculating the loss) #18312

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

How would one accomplish distributed training for SimCLR? (i.e whole batch needs to be aggregated before calculating the loss) #18312

Uh oh!

jeffwillette Aug 15, 2023

Replies: 0 comments

jeffwillette
Aug 15, 2023