Should the subfunction merge block apply a mask on the adjoint evaluation? by JHopeCollins · Pull Request #4177 · firedrakeproject/firedrake

JHopeCollins · 2025-03-31T14:53:23Z

Background

Adjoining subfunctions is hard because it has to know that it doesn't hold its own data, it just views the full function. This means that when a subfunction is used, it also needs to add the full function onto the tape as a dependency so that the correct data is used.
The pyadjoint.FloatingType base class does with two custom blocks:

When a subfunction usub = u.subfunctions[i] for component i of a full function u is added as a dependency to block block0 on the tape, what actually happens is:
1. a SubfunctionBlock is added to the tape with u as a dependency and usub as an output. This block just filters the usub component.
2. usub is then added as a dependency of block0. Now block0 implicitly depends on the value of u after filtering out everything but usub.
When usub is added as an output to block block1 on the tape, what actually happens is:
1. usub is added as an output to block block1.
2. a FunctionMergeBlock is added to the tape with usub as a dependency and u as an output. This block combines the data in usub with all data in u except that of component i. block1 now implicitly has u as an output, but only for the data in component i.

This test just creates a function in the R space, and tapes 1) assigning to it from a control 2) multiplying by 2 and 3) calculating the square. The tapes with/without using subfunctions are shown below, where you can see the extra blocks in the subfunction case. (The odd numbering is just so the code matches the tape).

with stop_annotating():
    mesh = UnitIntervalMesh(1)
    R = FunctionSpace(mesh, "R", 0)

    w2 = Function(R).assign(2.0)
    w4 = Function(R)
    w7 = u.subfunctions[0]

use_subfunctions = True

continue_annotation()
with set_working_tape() as tape:
    w4.assign(w2)
    if use_subfunctions:
        w7 *= 2
    else:
        w4 *= 2
    J = assemble(inner(w4, w4) * dx)
    rf = ReducedFunctional(J, Control(w2), tape=tape)
pause_annotation()

assert taylor_test(rf, w2, Constant(0.1)) > 1.9

Without using subfunctions:

Using subfunctions:

Problem

This test fails if using subfunctions!
We want the adj_value of the i-th component to depend only on what happens along the subfunction branch, and the adj_value of all other components to be whatever they were before the subfunctions split/merge.

Instead, the adjoint of FunctionMergeBlock was (correctly) outputting the i-th component of the adj_value along the subfunction branch of the tape, but (incorrectly) outputting all components of the adj_value along the full-function branch of the tape.
When these two branches recombined the adj_value for the i-th component then had contributions from both branches.

Solution

The merge out[i] = usub; out[not i] = u is a mask. The adjoint of a mask is also a mask, so FunctionMergeBlock now applies a mask before returning the adjoint outputs: output0 = adj_input[i]; output1 = adj_input[not i]

firedrake/adjoint_utils/blocks/function.py

Changes added

* comment to explain how we evaluate the adjoint of the FunctionMergeBlock

JHopeCollins added 2 commits March 31, 2025 15:48

Should subfunction merge block apply a mask on the adjoint evaluation?

4c87115

test for adjoining subfunction writing

292fff1

dham previously requested changes Apr 2, 2025

View reviewed changes

firedrake/adjoint_utils/blocks/function.py Show resolved Hide resolved

JHopeCollins marked this pull request as ready for review April 2, 2025 15:53

JHopeCollins added 3 commits April 8, 2025 09:30

Merge branch 'master' into JHopeCollins/subfunction_merge_eval_adj

4c7b74c

clearer name for subfunction index in subfunction adjoint blocks

00f5b6d

comment to explain how we evaluate the adjoint of the FunctionMergeBlock

9c431dc

JHopeCollins requested a review from rckirby April 8, 2025 08:47

rckirby approved these changes Apr 8, 2025

View reviewed changes

Merge branch 'master' into JHopeCollins/subfunction_merge_eval_adj

04aa602

connorjward merged commit abd9e35 into master Apr 9, 2025
13 checks passed

connorjward deleted the JHopeCollins/subfunction_merge_eval_adj branch April 9, 2025 15:41

pbrubeck pushed a commit that referenced this pull request Apr 10, 2025

Fix subfunction merge block adjoint (#4177)

bc82efd

* comment to explain how we evaluate the adjoint of the FunctionMergeBlock

This was referenced Nov 4, 2025

Fix subfunction hessian #4703

Closed

Fix subfunction hessian #4705

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Should the subfunction merge block apply a mask on the adjoint evaluation?#4177

Should the subfunction merge block apply a mask on the adjoint evaluation?#4177
connorjward merged 6 commits intomasterfrom
JHopeCollins/subfunction_merge_eval_adj

JHopeCollins commented Mar 31, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

JHopeCollins commented Mar 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Background

Problem

Solution

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

JHopeCollins commented Mar 31, 2025 •

edited

Loading