Jacobians of chol solve and lu solve are different #14926

GianmarcoCallegher · 2023-03-10T22:13:41Z

GianmarcoCallegher
Mar 10, 2023

The following code compares the jacobian of two different ways of solving a system of linear equations w.r.t. the coefficients matrix. I get different results according to the method that I use to factorize the coefficients matrix. Is that normal?

from jax import jacfwd, random

import jax.numpy as jnp
import jax.scipy as jsp


def solve_chol(A, b):
    L = jsp.linalg.cho_factor(A)
    return jsp.linalg.cho_solve(L, b)


def solve_lu(A, b):
    LU = jsp.linalg.lu_factor(A)
    return jsp.linalg.lu_solve(LU, b)


seed = 13
key = random.PRNGKey(seed=seed)

key, key_A, key_b = random.split(key, 3)

p = 3

A = random.normal(key=key_A, shape=(p, p))
A = A.T @ A + jnp.eye(p)

b = random.normal(key=key_b, shape=(p,))

# These two tests work
assert jnp.allclose(solve_chol(A, b), solve_lu(A, b))
assert jnp.allclose(solve_chol(A, b), jnp.linalg.solve(A, b))

# These two tests fail
assert jnp.allclose(jacfwd(solve_chol, 0)(A, b), jacfwd(solve_lu, 0)(A, b))
assert jnp.allclose(jacfwd(solve_chol, 0)(A, b), jacfwd(jnp.linalg.solve, 0)(A, b))

# This test work
assert jnp.allclose(jacfwd(solve_lu, 0)(A, b), jacfwd(jsp.linalg.solve, 0)(A, b))

Answered by mattjj

Mar 10, 2023

It's expected but surprising. It comes down to whether you think cholesky represents a function on symmetric square matrices, or on the upper triangles of any square matrices. JAX's default convention is to choose the former. See this comment on #10815.

One way to make them agree is to make the functions you're calling functions on symmetric matrices (via orthogonal projection onto that subspace):

from jax import jacfwd, random

import jax.numpy as jnp
import jax.scipy as jsp


def solve_chol(A, b):
    A = (A + A.T) / 2.  # NEW
    L = jsp.linalg.cho_factor(A)
    return jsp.linalg.cho_solve(L, b)


def solve_lu(A, b):
    A = (A + A.T) / 2.  # NEW
    LU = jsp.linalg.lu_factor(A)
    re…

View full answer

mattjj · 2023-03-10T22:37:33Z

mattjj
Mar 10, 2023
Maintainer

It's expected but surprising. It comes down to whether you think cholesky represents a function on symmetric square matrices, or on the upper triangles of any square matrices. JAX's default convention is to choose the former. See this comment on #10815.

One way to make them agree is to make the functions you're calling functions on symmetric matrices (via orthogonal projection onto that subspace):

from jax import jacfwd, random

import jax.numpy as jnp
import jax.scipy as jsp


def solve_chol(A, b):
    A = (A + A.T) / 2.  # NEW
    L = jsp.linalg.cho_factor(A)
    return jsp.linalg.cho_solve(L, b)


def solve_lu(A, b):
    A = (A + A.T) / 2.  # NEW
    LU = jsp.linalg.lu_factor(A)
    return jsp.linalg.lu_solve(LU, b)


seed = 13
key = random.PRNGKey(seed=seed)

key, key_A, key_b = random.split(key, 3)

p = 3

A = random.normal(key=key_A, shape=(p, p))
A = A.T @ A + jnp.eye(p)

b = random.normal(key=key_b, shape=(p,))

print(jacfwd(solve_chol, 0)(A, b))
print(jacfwd(solve_lu, 0)(A, b))

They could probably be made to agree as functions on just one particular triangle of the input, though that'd take a little more fiddling.

WDYT?

1 reply

GianmarcoCallegher Mar 14, 2023
Author

Thank you very much. Your explanation in this comment is really clear. I see your point and why it makes sense to project the matrix onto the subspace of symmetric matrices. It is kinda weird but probably it makes sense to leave things as they are 😄

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Jacobians of chol solve and lu solve are different #14926

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Jacobians of chol solve and lu solve are different #14926

Uh oh!

Uh oh!

GianmarcoCallegher Mar 10, 2023

Replies: 1 comment · 1 reply

Uh oh!

mattjj Mar 10, 2023 Maintainer

Uh oh!

GianmarcoCallegher Mar 14, 2023 Author

GianmarcoCallegher
Mar 10, 2023

Replies: 1 comment 1 reply

mattjj
Mar 10, 2023
Maintainer

GianmarcoCallegher Mar 14, 2023
Author