Does `jax.value_and_grad` imply jit? #16678

ayaka14732 · 2023-07-11T04:51:54Z

ayaka14732
Jul 11, 2023

From my understanding, jax.value_and_grad may imply jit because it needs to somehow compile the function.

Answered by itk22

Jul 11, 2023

Hi. As far as I understand, this is not the case. To be able to calculate the value and gradient of a function, all you need is a 'recipe' for differentiation, which is either inferred from the computational graph or defined with a custom_vjp. I attached an example of a function which cannot be jitted because it uses calls to original numpy, but can be differentiated because a custom_vjp is defined:

import jax
import numpy as np
import jax.numpy as jnp
from jax import custom_vjp

@custom_vjp
def f(x):
    # The function deliberately uses numpy functions which are not jittable
    return -(x[0] + np.sin(x[0])) * np.exp(-x[0]**2.0)


def f_fwd(x):
    return f(x), (x, )


def f_bwd(res, g):…

View full answer

itk22 · 2023-07-11T11:05:41Z

itk22
Jul 11, 2023

Hi. As far as I understand, this is not the case. To be able to calculate the value and gradient of a function, all you need is a 'recipe' for differentiation, which is either inferred from the computational graph or defined with a custom_vjp. I attached an example of a function which cannot be jitted because it uses calls to original numpy, but can be differentiated because a custom_vjp is defined:

import jax
import numpy as np
import jax.numpy as jnp
from jax import custom_vjp

@custom_vjp
def f(x):
    # The function deliberately uses numpy functions which are not jittable
    return -(x[0] + np.sin(x[0])) * np.exp(-x[0]**2.0)


def f_fwd(x):
    return f(x), (x, )


def f_bwd(res, g):
    x = res[0]
    grad_x = np.exp(-x[0]**2) * (2 * x[0]**2 + 2 * x[0] * np.sin(x[0]) - np.cos(x[0]) - 1)
    return (np.array([grad_x * g]),)


f.defvjp(f_fwd, f_bwd)

print_errors = False  # Flag for printing the errors

# Check if the custom function is jittable
try:
    f_jitted = jax.jit(f)
    f_jitted(np.array([1.0]))
    print("Function is jittable")
except Exception as e:
    print("Function is not jittable")
    if print_errors:
        print("Error: ", e)

# Check if the custom function is differentiable
try:
    f_grad = jax.grad(f)
    f_grad(np.array([1.0]))
    print("Function is differentiable")
except Exception as e:
    print("Function is not differentiable")
    if print_errors:
        print("Error: ", e)

I hope this illustrates the point that you don't need to be able to jit a function to find its gradients. An important point to mention here is that the function will be compiled no matter if you use JIT or not. However, If you do use JIT (just in time compilation), you can benefit from the XLA optimizations.

1 reply

ayaka14732 Jul 11, 2023
Author

Thank you for your detailed explanation!

jakevdp · 2023-07-11T13:58:02Z

jakevdp
Jul 11, 2023
Maintainer

There's a more concise way to see that grad does not imply jit – grad uses concrete tracers rather than dynamic tracers, and can transform functions that are not jit compatible! For example:

import jax

def relu(x):
  return 0.0 if x < 0 else x

print(jax.grad(relu)(1.0))
# 1.0

print(jax.jit(relu)(1.0))
# ConcretizationTypeError: Abstract tracer value encountered...

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Does `jax.value_and_grad` imply jit? #16678

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Does jax.value_and_grad imply jit? #16678

Uh oh!

ayaka14732 Jul 11, 2023

Replies: 2 comments · 1 reply

Uh oh!

itk22 Jul 11, 2023

Uh oh!

Uh oh!

ayaka14732 Jul 11, 2023 Author

Uh oh!

jakevdp Jul 11, 2023 Maintainer

Does `jax.value_and_grad` imply jit? #16678

ayaka14732
Jul 11, 2023

Replies: 2 comments 1 reply

itk22
Jul 11, 2023

ayaka14732 Jul 11, 2023
Author

jakevdp
Jul 11, 2023
Maintainer