perf: VQE ablation tests #1416

mofeing · 2025-06-23T16:26:00Z

No description provided.

perf/vqe/Circuit.jl

mofeing · 2025-06-24T20:46:48Z

Captura de pantalla 2025-06-24 a las 22 42 04

@wsmoses could there be a pass in :all mode that reverts the performance gain in the passes of :before_enzyme and :after_enzyme?

this benchmark is mostly only dot_general and involved transposes.

wsmoses · 2025-06-24T21:03:54Z

Yeah potentially, definitely merits investigation.

cc @avik-pal esp cuz fft related issue is similar

avik-pal · 2025-06-24T21:55:14Z

Can you check the mlir for each case and see the trace under xprof? I have seen the numbers to be quite noisy (and sometimes make zero sense with wildly different timings for same mlir) Best, Avik

…

On Tue, 24 Jun, 2025, 17:04 William Moses, ***@***.***> wrote: *wsmoses* left a comment (EnzymeAD/Reactant.jl#1416) <#1416 (comment)> Yeah potentially, definitely merits investigation. cc @avik-pal <https://github.com/avik-pal> esp cuz fft related issue is similar — Reply to this email directly, view it on GitHub <#1416 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AHJF57S7XDEGNQ5TDWCAOTL3FG4NBAVCNFSM6AAAAAB754BNH2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZTAMBRHA2TINRXGU> . You are receiving this because you were mentioned.Message ID: ***@***.***>

mofeing · 2025-06-24T23:57:12Z

Here is the MLIR generated for each case: vqe-mlir.zip

I'm running into problems with xprof. I'm doing the following:

Reactant.with_profiler(@__DIR__; trace_host=true, trace_device=true) do
    ∇f_xla = @compile compile_options = Reactant.DefaultXLACompileOptions(; sync=true) ∇expectation(
        params_re, observable_re, coef_re
    )
    for _ in 1:100
        ∇f_xla(params_re, observable_re, coef_re)
    end
end

and I get the following message and no data on my traces:

avik-pal · 2025-06-25T01:45:12Z

The gradients looks quite strange. Did some loop or vector op get scalarized?

mofeing · 2025-06-25T07:13:45Z

The gradients looks quite strange. Did some loop or vector op get scalarized?

Ahh that's due to the fact that I was zero initializing the parameters and just using one Hamiltonian term; i.e. the ket and the bra are then effectively perpendicular and thus, the primal value and the gradients are zero.

Random initialization shows some better but small gradients. Most probably I would need to add more Hamiltonian terms but right now, we are just sequentially running the gradient function over all the Hamiltonian terms (some parallelization using MPI) and then summing. Batching over the Hamiltonian terms requires some more work but this benchmark (which you can imagine as just running runs 1 epoch, 1 sample) is a good reflection of what we do.

tldr: Gradients being zero are / were a numerical issue, not a bug introduced by a pass.

…n terms of 40 qubits case (still valid)

mofeing · 2025-07-22T11:46:40Z

There is a bug introduced in EnzymeAD/Enzyme-JAX#1121 breaks the rev diff rule of stablehlo.dynamic_update_slice and thus, this benchmark is currently fixed to Reactant v0.2.138

For 128 hamiltonian terms, 40 qubits and 6 layers of the EfficientSU2 ansatz, I'm getting the following results:

The same for 50 qubits seems to have problems:

It could also be noise as I've tested in my laptop and on CPU, and as the benchmarks take longer, less samples are taken.

mofeing force-pushed the ss/ablation-tests/vqe branch from 66efa2c to 216e3bf Compare June 24, 2025 11:48

github-actions bot reviewed Jun 24, 2025

View reviewed changes

perf/vqe/Circuit.jl Outdated Show resolved Hide resolved

perf/vqe/Circuit.jl Outdated Show resolved Hide resolved

perf/vqe/Circuit.jl Outdated Show resolved Hide resolved

mofeing added 6 commits July 22, 2025 12:58

Setup the environment for the VQE ablation tests

891cd76

Update VQE benchmark

051c230

Remove random ind names from Circuit

b040316

Stack Tensor Networks using generic_stack

7b14202

Random initialize parameters

332e797

update

aa11882

mofeing force-pushed the ss/ablation-tests/vqe branch from 2a1adc3 to aa11882 Compare July 22, 2025 10:58

mofeing added 6 commits July 22, 2025 13:04

Use local Reactant

73823b1

last touches

76f901d

Log contraction path on debug level

59e3fb9

Increase layers to 6

0584490

Fix Reactant version to 0.2.138

48cadcd

Update a copy of "hamiltonian-terms-n30.txt" to be used as Hamiltonia…

8ab8fde

…n terms of 40 qubits case (still valid)

mofeing marked this pull request as ready for review July 22, 2025 11:47

mofeing requested a review from wsmoses July 22, 2025 11:47

mofeing added 3 commits July 22, 2025 13:58

Set @allowscalar on YaoBlocks extension

2b173d3

Make logging less annoying

903ed7f

Benchmark on 50 qubits

e50a9b3

mofeing mentioned this pull request Jul 31, 2025

rev diff of stablehlo.dynamic_update_slice is broken for complex tensors EnzymeAD/Enzyme-JAX#1211

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf: VQE ablation tests #1416

perf: VQE ablation tests #1416

Uh oh!

mofeing commented Jun 23, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mofeing commented Jun 24, 2025

Uh oh!

wsmoses commented Jun 24, 2025

Uh oh!

avik-pal commented Jun 24, 2025 via email

Uh oh!

mofeing commented Jun 24, 2025

Uh oh!

avik-pal commented Jun 25, 2025

Uh oh!

mofeing commented Jun 25, 2025

Uh oh!

mofeing commented Jul 22, 2025 •

edited

Loading

Uh oh!

Uh oh!

perf: VQE ablation tests #1416

Are you sure you want to change the base?

perf: VQE ablation tests #1416

Uh oh!

Conversation

mofeing commented Jun 23, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mofeing commented Jun 24, 2025

Uh oh!

wsmoses commented Jun 24, 2025

Uh oh!

avik-pal commented Jun 24, 2025 via email

Uh oh!

mofeing commented Jun 24, 2025

Uh oh!

avik-pal commented Jun 25, 2025

Uh oh!

mofeing commented Jun 25, 2025

Uh oh!

mofeing commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

mofeing commented Jul 22, 2025 •

edited

Loading