FP16 does not decrease much GPU memory #13252

icoz69 · 2022-06-08T12:48:00Z

icoz69
Jun 8, 2022

hi, I just tried mix precision training with precision=16 set in the trainer. I found the training speed does increase by around 30%, but the GPU memory merely decreases. Should it be half of the raw memory?

Answered by akihironitta

Jun 9, 2022

Hi @icoz69 AFAIK, the memory usage depends on your model architecture, specifically, the ratio of the model size to the size of activations. This is because, with amp, your model always stays in fp32 while some operations in your model are done in fp16.

View full answer

akihironitta · 2022-06-09T00:55:47Z

akihironitta
Jun 9, 2022

Hi @icoz69 AFAIK, the memory usage depends on your model architecture, specifically, the ratio of the model size to the size of activations. This is because, with amp, your model always stays in fp32 while some operations in your model are done in fp16.

2 replies

m13uz Aug 19, 2022

which operations specifically are done in fp16?

akihironitta Aug 20, 2022

See https://pytorch.org/docs/stable/amp.html#autocast-op-reference.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

FP16 does not decrease much GPU memory #13252

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

FP16 does not decrease much GPU memory #13252

Uh oh!

icoz69 Jun 8, 2022

Replies: 1 comment · 2 replies

Uh oh!

Uh oh!

akihironitta Jun 9, 2022

Uh oh!

m13uz Aug 19, 2022

Uh oh!

akihironitta Aug 20, 2022

icoz69
Jun 8, 2022

Replies: 1 comment 2 replies

akihironitta
Jun 9, 2022