How to predict the amount of memory (VRAM) consumed by forward pass? #12289

marloquemegusta · 2022-03-10T09:09:17Z

marloquemegusta
Mar 10, 2022

Hi!
I understand that performing the forward pass on a network during training consumes GPU memory as it needs to store the activations of the neurons. If I am not wrong, memory consumption should be proportional to the batch size, as it needs to store the activations for each data point.
I am trying to estimate the memory consumption vs batch size, and while measuring it, I see that doubling the batch size does not make the memory consumption double.
Is there something I am missing?
Thanks in advance, you all make a great community!

rohitgr7 · 2022-03-16T09:53:03Z

rohitgr7
Mar 16, 2022

I think that's not the case always and it depends upon the model and computation done during forward pass. For eg., if you have batch norm layers then I believe they have some states too which don't scale with batch_size so even if you change the batch_size this constant factor will still be there.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to predict the amount of memory (VRAM) consumed by forward pass? #12289

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How to predict the amount of memory (VRAM) consumed by forward pass? #12289

Uh oh!

marloquemegusta Mar 10, 2022

Replies: 1 comment

Uh oh!

rohitgr7 Mar 16, 2022

marloquemegusta
Mar 10, 2022

rohitgr7
Mar 16, 2022