ReverseDiff doesn't like DynamicPPL.ReshapeTransform

#555 introduced `DynamicPPL.ReshapeTransform`, which is very nice, but there's what seems to be a bug in ReverseDiff.jl which causes it to fail when ReshapeTransform is composed with a broadcasted function.

I reported the upstream bug at https://github.com/JuliaDiff/ReverseDiff.jl/issues/265. In the context of DynamicPPL, this occurs when we have something like the following:

```julia
using DynamicPPL: invlink_transform, ReshapeTransform
using ReverseDiff

f(x) = invlink_transform(InverseGamma(2, 3))
g(x) = ReshapeTransform(())(x)
h = f ∘ g
ReverseDiff.gradient(h, [1.0])
```

I suspect we should be able to change the implementation of `ReshapeTransform` though to try to circumvent this. I don't actually know all the possible shapes of stuff `ReshapeTransform` handles and whether different input/output shapes would give different ReverseDiff errors. However, I dug into [a couple of the failing tests in Turing.jl](https://github.com/TuringLang/Turing.jl/pull/2376#issuecomment-2435695970), and it seems that both of them stem from ReshapeTransform being given singleton arrays (e.g. `[1.0]` above). Furthermore, the error message observed in all the other failing tests is the same (although I didn't verify that they ultimately stem from singleton arrays). So I think we could special-case this behaviour to keep ReverseDiff on our side.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ReverseDiff doesn't like DynamicPPL.ReshapeTransform #698

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

ReverseDiff doesn't like DynamicPPL.ReshapeTransform #698

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions