Efficient computation of per-example model parameter gradients. #8264

JulienSiems · 2021-10-18T19:38:28Z

JulienSiems
Oct 18, 2021

Hey everyone!

I would like to obtain the individual mini-batch gradients for each example in a mini-batch in jax.
In PyTorch this is only possible with some hacks/packages such as autograd-hacks or backpack.

I saw the example in the docs explaining how to compute the derivative with respect to each input batch element, but I noticed that in order to obtain these gradients the example used the loss function in a per-example way. I was wondering whether there is a way to get the per example parameter gradients directly via reverse mode autodiff based on the averaged loss function?
This would allow obtaining the individual mini-batch gradients during e.g. a normal training routine without having to recompute the backward pass.

Thanks a lot!
Julien

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Efficient computation of per-example model parameter gradients. #8264

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Efficient computation of per-example model parameter gradients. #8264

Uh oh!

JulienSiems Oct 18, 2021

Replies: 0 comments

JulienSiems
Oct 18, 2021