Compute gradient for a batch of inputs together with the params of network in JAX #8481

Behnam-Asadi · 2021-11-06T20:27:38Z

Behnam-Asadi
Nov 6, 2021

As I am new to JAX it might be a naive question but what is the best way to update a batch of input using grad_and_value function? In my case inputs of the network are learnable embedded vectors and as in each iteration, the input is a batch of these vectors I need to update params of the network together with the input batch (a batch of embedded vectors). I think I can define the whole vectors as params of the network but this way grad_and_value would compute gradient with respect to all vectors instead of just a specific batch.

bionicles · 2021-11-07T23:22:03Z

bionicles
Nov 7, 2021

https://jax.readthedocs.io/en/latest/jax.html#jax.value_and_grad

Sounds like you want to take the gradient to only certain inputs -- is argnums what you need?

argnums (Union[int, Sequence[int]]) – Optional, integer or sequence of integers. Specifies which positional argument(s) to differentiate with respect to (default 0).

0 replies

Behnam-Asadi · 2021-12-10T23:59:33Z

Behnam-Asadi
Dec 10, 2021
Author

This helped me train the embedding vectors (probably in an subefficient way) but I am looking a pre-define module like torch.nn.Embedding in PyTorch to train the embeddings in a more efficient way.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Compute gradient for a batch of inputs together with the params of network in JAX #8481

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Compute gradient for a batch of inputs together with the params of network in JAX #8481

Uh oh!

Uh oh!

Behnam-Asadi Nov 6, 2021

Replies: 2 comments

Uh oh!

bionicles Nov 7, 2021

Uh oh!

Behnam-Asadi Dec 10, 2021 Author

Behnam-Asadi
Nov 6, 2021

bionicles
Nov 7, 2021

Behnam-Asadi
Dec 10, 2021
Author