Reusing computation results across multiple gradients in JAX #16464

itk22 · 2023-06-17T07:18:39Z

itk22
Jun 17, 2023

Hello,
I'm working on a project where I need to compute two separate values (an objective and a constraint) and their gradients, both of which depend on the same expensive operation. Currently, when I compute the gradients separately using jax.grad, the expensive operation is executed twice (see the example below). Given the high computational cost of that operation, I would like to avoid repeating it.

import jax
from jax import grad, jit
import jax.numpy as jnp

# Dummy expensive operation
def expensive_function(x):
    return jnp.square(x)

# Objective
def objective_fn(x):
    y = expensive_function(x)
    return jnp.sum(y)

# Constraint
def constraint_fn(x):
    y = expensive_function(x)
    return jnp.sum(y * 2)  

# Inputs
x = jnp.array([1.0, 2.0, 3.0, 4.0, 5.0], dtype=jnp.float32)

# Gradients
objective_grad = grad(objective_fn)(x)
constraint_grad = grad(constraint_fn)(x)

Is there a way to compute the gradients of both values together, while reusing the result of the expensive operation? I understand that JAX avoids side effects and mutable state by design, which makes caching results across multiple function calls tricky. Is there any workaround or recommended way to handle this type of scenario?

Or should I perhaps rely on JIT for common subexpression elimination as outlined in Discussion #16356?

Answered by davisyoshida

Jun 19, 2023

If you want to put stuff together manually in this case you could use jax.vjp, but you'd need to refactor objective_fn and constraint_fn to take y as an argument. Here's what it might look like:

import jax    
from jax import grad, jit    
import jax.numpy as jnp    
    
# Dummy expensive operation    
def expensive_function(x):    
    return jnp.square(x)    
    
# Objective    
def objective_fn(y):    
    return jnp.sum(y)    
    
# Constraint    
def constraint_fn(y):    
    return jnp.sum(y * 2)··    
    
# Inputs    
x = jnp.array([1.0, 2.0, 3.0, 4.0, 5.0], dtype=jnp.float32)    
    
# Gradients    
y, vjp_fn = jax.vjp(expensive_function, x)    
objective_ygrad = grad(objecti…

View full answer

davisyoshida · 2023-06-19T20:45:21Z

davisyoshida
Jun 19, 2023
Collaborator

If you want to put stuff together manually in this case you could use jax.vjp, but you'd need to refactor objective_fn and constraint_fn to take y as an argument. Here's what it might look like:

import jax    
from jax import grad, jit    
import jax.numpy as jnp    
    
# Dummy expensive operation    
def expensive_function(x):    
    return jnp.square(x)    
    
# Objective    
def objective_fn(y):    
    return jnp.sum(y)    
    
# Constraint    
def constraint_fn(y):    
    return jnp.sum(y * 2)··    
    
# Inputs    
x = jnp.array([1.0, 2.0, 3.0, 4.0, 5.0], dtype=jnp.float32)    
    
# Gradients    
y, vjp_fn = jax.vjp(expensive_function, x)    
objective_ygrad = grad(objective_fn)(y)    
constraint_ygrad = grad(constraint_fn)(y)    
    
objective_grad = vjp_fn(objective_ygrad)[0]    
constraint_grad = vjp_fn(constraint_ygrad)[0]

I'm not 100% sure this is correct since I'm not a JAX expert, but I believe it's basically doing what you want. The Autodiff Cookbook is a great resource which explains the various autodiff options in JAX.

2 replies

itk22 Jun 20, 2023
Author

Thank you for your answer @davisyoshida. This looks promising and I will make sure to check out the autodiff cookbook in more detail!

davisyoshida Jun 20, 2023
Collaborator

Happy to help! BTW I do think it's likely that CSE will do the trick if you jit everything together, but if you need to be jitting things separately for whatever reason this may help.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reusing computation results across multiple gradients in JAX #16464

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Reusing computation results across multiple gradients in JAX #16464

Uh oh!

itk22 Jun 17, 2023

Replies: 1 comment · 2 replies

Uh oh!

davisyoshida Jun 19, 2023 Collaborator

Uh oh!

itk22 Jun 20, 2023 Author

Uh oh!

davisyoshida Jun 20, 2023 Collaborator

itk22
Jun 17, 2023

Replies: 1 comment 2 replies

davisyoshida
Jun 19, 2023
Collaborator

itk22 Jun 20, 2023
Author

davisyoshida Jun 20, 2023
Collaborator