Understanding how data is split between GPUs when using `.predict()` #9773

aakaashjois · 2021-09-30T18:29:02Z

aakaashjois
Sep 30, 2021

Hello,

I have a model trained with PyTorch Lightning and I am trying to use it for making predictions on a set of data. I use the predict_step method to handle the necessary work to return an output. I have a callback which, on_predict_batch_end writes the set of inputs and outputs to a file for each GPU. I am using 8 GPUs with ddp and batch size of 1 to run this whole setup.

Looking at the data which is written, I see that all 8 GPUs receive the same inputs. Based on what I understand from Multi-GPU Batch Size docs, each GPU should be receive a different batch of data.

Can anyone help me out on trying to understand how the data is being distributed?

Thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Understanding how data is split between GPUs when using `.predict()` #9773

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Understanding how data is split between GPUs when using .predict() #9773

Uh oh!

aakaashjois Sep 30, 2021

Replies: 0 comments

Understanding how data is split between GPUs when using `.predict()` #9773

aakaashjois
Sep 30, 2021