Skip to content
Discussion options

You must be logged in to vote

Solved: gradients weren't being propagated through the diffusion loss term properly.

To replace jax.jvp in PyTorch: one should use functorch.jvp to properly propagate gradients, or use torch.autograd.functional.jvp with create_graph=True.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by ehonig
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
1 participant