Where should I place my optimizer.zero_grad? #534

vence-andersen · 2023-07-09T18:50:32Z

vence-andersen
Jul 9, 2023

Hie,

After watching your course content now, I am reading others code to see how other solve a problem. The one thing I noticed was, some had the "optimizer.zero_grad()" at the last like they had "loss.backward()", "optimizer.step()", and then finally they had "optimizer.zero_grad()". From your unofficial PyTorch song (I just sing it every time to remember the steps) and of course your course contents I thought we need to follow an order. Can I interchange the position of all three or like I can only move the "zero_grad" to the last. Could you please clarify it.

Thanks in advance.

abhishekmann · 2023-07-13T08:21:29Z

abhishekmann
Jul 13, 2023

Hi,

Thing is you can place the optimizer.zero_grad() anywhere as long as the gradients are emptied before the next optimization step. So placing it at the beginning right after when you're iterating over dataloaders or right at the end right after optimizer.step() or somewhere in between, it all works. Just make sure the gradients are empty before optimizer.step() is called that is the keypoint.

Hope that helps.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Where should I place my optimizer.zero_grad? #534

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Where should I place my optimizer.zero_grad? #534

Uh oh!

vence-andersen Jul 9, 2023

Replies: 1 comment

Uh oh!

abhishekmann Jul 13, 2023

vence-andersen
Jul 9, 2023

abhishekmann
Jul 13, 2023