Compute Loss After Sharing Tensor Across GPUs #7602

Zasder3 · 2021-05-19T01:19:20Z

Zasder3
May 19, 2021

I’m currently attempting to make a Multi-GPU-supported CLIP training script, but am hitting a wall. I need to compute two matrices that are composed of whole batch statistics before I can compute loss. Namely, I need to compute the image and text embeddings of an entire batch. Only then can I compute the sub batch losses.

How can I first calculate and share the whole batch matrices across GPUs before computing losses?

Answered by Zasder3

May 20, 2021

The LightningModule method all_gather(Tensor) solved it all!

View full answer

Zasder3 · 2021-05-20T01:45:36Z

Zasder3
May 20, 2021
Author

The LightningModule method all_gather(Tensor) solved it all!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Compute Loss After Sharing Tensor Across GPUs #7602

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Compute Loss After Sharing Tensor Across GPUs #7602

Uh oh!

Zasder3 May 19, 2021

Replies: 1 comment

Uh oh!

Zasder3 May 20, 2021 Author

Zasder3
May 19, 2021

Zasder3
May 20, 2021
Author