Calculate first and second order derivative of network output with respect to input #8707

lucabeetz · 2021-11-27T13:42:52Z

lucabeetz
Nov 27, 2021

I have a simple MLP with one input and one output and would like to calculate the first and second order derivatives of the individual outputs with respect to the respective input. Basically I'd like to differentiate the following function twice and evaluate the derivatives at x_physics, which has shape (batch_size, 1), the same as the output of physics_pred. I would like to get the element-wise derivatives: [dy0/dx0, dy1/dx1,...]. The net takes a single scalar x as input and returns a single scalar y.

def physics_pred(x_physics, params):
    y_physics = net.apply(params, x_physics)
    return y_physics

I tried using grad together with vmap for 'adding' the batch dimension but I got an error and I am not sure if this would be the best way of doing it.

In the end I'd like to use these derivatives in a physics-informed loss:

def loss_physics(params: hk.Params, x_data: jnp.array, y_data: jnp.array, x_physics: jnp.array):
    y_pred_data = net.apply(params, x_data)
    data_loss = jnp.mean((y_pred_data - y_data)**2)

    # Function to differentiate
    def physics_pred(x_physics, params):
        y_physics = net.apply(params, x_physics)
        return y_physics

    y_pred_physics = physics_pred(x_physics, params)
    df_dx = # calculate first order derivatives
    df_dx2 = # calculate second order derivatives

    residual = df_dx2 + mu * df_dx + k * y_pred_physics
    physics_loss = (1e-4) * jnp.mean(residual**2)

    return data_loss + physics_loss

Edit: Here is a minimal working example of what I'd like to achieve:

from jax import grad, vmap

xs = jnp.linspace(0, 1, 10)

def y(x):
    return x**2

dx = vmap(grad(y))(xs)
dx2 = vmap(grad(grad(y)))(xs)

In this example the function y corresponds to the MLP and dx and dx2 are the first and second order element-wise derivatives of y with r.spect to x.

Edit 3: This solution seems to work but I am not sure if it is the most efficient one:

def loss_physics(params: hk.Params, x_data: jnp.array, y_data: jnp.array, x_physics: jnp.array):
    y_pred_data = net.apply(params, x_data)
    data_loss = jnp.mean((y_pred_data - y_data)**2)

    # The solution to the differential equation is represented by our network
    u = lambda x: net.apply(params, x)

    # Calculate first and second derivates of network
    u_dx = lambda x: jax.grad(lambda x: jnp.sum(u(x)))(x)
    u_dx2 = lambda x: jax.grad(lambda x: jnp.sum(u_dx(x)))(x)

    # Compute physical loss
    y_pred_physics = net.apply(params, x_physics)
    residual = u_dx2(x_physics) + mu * u_dx(x_physics) + k * y_pred_physics
    physics_loss = (1e-4) * jnp.mean(residual**2)

Answered by jakevdp

Nov 27, 2021

Can you add more information about the shape of the inputs and outputs of physics_pred? It's also not clear to me what kind of derivative you're interested in: do you want the element-wise [dy/dx0, dy/dx1...], or if y is a vector do you want the matrix of derivatives {dyi/dxj}? Or, given that y might be a vector of the same length as x, do you want element-wise derivatives [dy0/dx0, dy1/dx1,...]?

It would be most useful if you could edit your question to add a minimal reproducible example, such that we could run your code and see the same outputs that you're seeing, rather than just guessing at what your function might do. Note the "minimal" here does not imply giving us your entire neura…

View full answer

jakevdp · 2021-11-27T14:44:34Z

jakevdp
Nov 27, 2021
Maintainer

Can you add more information about the shape of the inputs and outputs of physics_pred? It's also not clear to me what kind of derivative you're interested in: do you want the element-wise [dy/dx0, dy/dx1...], or if y is a vector do you want the matrix of derivatives {dyi/dxj}? Or, given that y might be a vector of the same length as x, do you want element-wise derivatives [dy0/dx0, dy1/dx1,...]?

It would be most useful if you could edit your question to add a minimal reproducible example, such that we could run your code and see the same outputs that you're seeing, rather than just guessing at what your function might do. Note the "minimal" here does not imply giving us your entire neural net; it might be sufficient for the sake of the question to replace net.apply(params, x_physics) with x_physics ** 2 or x_physics.sum(), depending on the characteristics of your function.

3 replies

lucabeetz Nov 27, 2021
Author

Thank you for the quick reply, I have added a minimal example and am interested in the element-wise derivatives [dy0/dx0, dy1/dx1,...] and how to compute them batch-wise.

jakevdp Nov 28, 2021
Maintainer

Yes, I think vmap(grad(f)) and vmap(grad(grad(f)) is exactly the correct approach to computing element-wise first and second derivatives of a batched function.

But the solution in your edit computes something different (the second derivative there is not an element-wise second derivative, but appears to be something like Σᵢⱼd²y/dxᵢdxⱼ) So I think the most important thing here is to understand mathematically what you wish to compute before attempting to debug the code.

krzysztofrusek Nov 29, 2021

@lucabeetz

A few days ago I needed something similar and here is my approach that works:

@jax.jit
def grad_fn(params, rng, x):
    @functools.partial(jax.vmap, in_axes=(0, None, None))
    @jax.grad
    def fn(x, params, rng):
        return nn.apply(params, rng, jnp.expand_dims(x, 0))[0]

    return fn(x, params, rng)

Basically what @jakevdp proposed but with the swapped positions of params and x since JAX differentiates wrt the first argument.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Calculate first and second order derivative of network output with respect to input #8707

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment 3 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Calculate first and second order derivative of network output with respect to input #8707

Uh oh!

Uh oh!

lucabeetz Nov 27, 2021

Replies: 1 comment · 3 replies

Uh oh!

Uh oh!

jakevdp Nov 27, 2021 Maintainer

Uh oh!

lucabeetz Nov 27, 2021 Author

Uh oh!

Uh oh!

jakevdp Nov 28, 2021 Maintainer

Uh oh!

krzysztofrusek Nov 29, 2021

lucabeetz
Nov 27, 2021

Replies: 1 comment 3 replies

jakevdp
Nov 27, 2021
Maintainer

lucabeetz Nov 27, 2021
Author

jakevdp Nov 28, 2021
Maintainer