Reduce allocations in stepsize.jl by devmotion · Pull Request #390 · TuringLang/AdvancedHMC.jl

devmotion · 2025-02-18T22:16:26Z

No description provided.

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

yebai · 2025-03-17T12:07:33Z

@devmotion can you fix the merge clash before I review this PR?

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

yebai

Thanks @devmotion -- I left a few questions below.

I'll have to take another closer look later this week.

yebai · 2025-03-17T18:12:48Z

src/adaptation/stepsize.jl

 end

-computeμ(ϵ::AbstractScalarOrVec{<:AbstractFloat}) = log.(10 * ϵ)
+computeμ(ϵ::AbstractFloat) = log(10 * ϵ)


Caution is required here: these support the vectorised version of HMC. Do you know how map would differ from broadcasting here?

The results of the calculations won't be affected by this change, but using the non-broadcasted formulation for scalars and map for vectors of floats will remove the broadcasting overhead and reduce stress on the compiler, i.e., generally it reduces compilation time. Sometimes it also helps type inference (but this case is too simple for this effect I assume).

In my experience, broadcasting is useful if one's actually broadcasting values of different size and dimensions but otherwise often a suboptimal choice.

src/adaptation/stepsize.jl

yebai · 2025-03-17T18:23:25Z

src/adaptation/stepsize.jl

 function finalize!(da::NesterovDualAveraging)
-    da.state.ϵ = exp.(da.state.x_bar)
-    return nothing
+    finalize!(da.state)


nice improvement!

yebai · 2025-03-17T18:24:13Z

src/adaptation/stepsize.jl


    η_H = one(T) / (m + t_0)
-    H_bar = (one(T) - η_H) * H_bar .+ η_H * (δ .- α)
+    H_bar = (one(T) - η_H) .* H_bar .+ η_H .* (δ .- min.(one(T), α))


HG: I'll have to review these more carefully later this week.

EDIT: This looks good. I am surprised the previous code didn't break any tests, as it didn't properly support vectorised adaption.

yebai

Thanks, @devmotion. Nice improvements. I left a few comments below, mostly about whether we should refactor the vectorised HMC implementation in a concerted effort separately to avoid inconsistency.

src/adaptation/stepsize.jl

yebai · 2025-03-26T12:09:38Z

src/adaptation/stepsize.jl

 function DAState(ϵ::AbstractVector{T}) where {T}
    n = length(ϵ)
-    μ = computeμ(ϵ)
+    μ = map(computeμ, ϵ)


Suggested change

μ = map(computeμ, ϵ)

μ = computeμ(ϵ)

yebai · 2025-03-26T12:11:52Z

src/adaptation/stepsize.jl

-    das.μ .= computeμ(das.ϵ)
-    das.x_bar .= zero(T)
-    return das.H_bar .= zero(T)
+    map!(computeμ, das.μ, das.ϵ)


Let's keep this as-is for now. We could refactor the vectorised HMC interface, but better to do it seprately in a concerted effort:

Suggested change

map!(computeμ, das.μ, das.ϵ)

das.μ .= computeμ(das.ϵ)

This suggestion would go against the main intention of the PR, reducing unnecessary allocations: With map! (or das.μ .= computeμ.(das.ϵ), but the broadcasting is more stressful for the compiler) no intermediate array would be created in this line, whereas with the suggestion on the right-hand side a new array is allocated that is then copied to das.μ (as a side remark, for the compiler copyto! should be simpler than broadcasting).

Fair point!

I recently discovered the AcceleratedKernels package, which provides a unified interface for parallelisation on CPUs, clusters, and GPUs. We could consider switching to AcceleratedKernels.map! for the vectorised HMC implementation, thus the above suggestion.

EDIT: I opened an issue for this suggestion. #412

src/adaptation/stepsize.jl

yebai · 2025-03-26T14:36:08Z

Feel free to merge once CI passes!

devmotion and others added 4 commits February 18, 2025 23:15

Reduce allocations in stepsize.jl

e83ae54

Fix format

b597b98

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Fix finalize and add type parameter

d8ee6f0

Fix format

018dcc4

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

yebai self-requested a review March 17, 2025 12:07

yebai assigned devmotion Mar 17, 2025

devmotion and others added 2 commits March 17, 2025 14:23

Merge branch 'main' into dw/adaptation_stepsize

bb73bce

Apply suggestions from code review

8b4bb3c

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

yebai reviewed Mar 17, 2025

View reviewed changes

yebai requested changes Mar 26, 2025

View reviewed changes

yebai requested a review from penelopeysm March 26, 2025 12:35

yebai mentioned this pull request Mar 26, 2025

Refactor vectorised HMC implementations using AcceleratedKernels.jl #412

Open

yebai previously approved these changes Mar 26, 2025

View reviewed changes

penelopeysm previously approved these changes Mar 26, 2025

View reviewed changes

Update src/adaptation/stepsize.jl

1dc6ccc

yebai dismissed stale reviews from penelopeysm and themself via 1dc6ccc March 26, 2025 14:29

Merge branch 'main' into dw/adaptation_stepsize

4412e7b

yebai approved these changes Mar 26, 2025

View reviewed changes

yebai merged commit a96ab41 into main Mar 27, 2025
17 checks passed

yebai deleted the dw/adaptation_stepsize branch March 27, 2025 15:31

Conversation

devmotion commented Feb 18, 2025

Uh oh!

yebai commented Mar 17, 2025

Uh oh!

yebai left a comment

Choose a reason for hiding this comment

Uh oh!

yebai Mar 17, 2025

Choose a reason for hiding this comment

Uh oh!

devmotion Mar 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

yebai Mar 17, 2025

Choose a reason for hiding this comment

Uh oh!

yebai Mar 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yebai left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

yebai Mar 26, 2025

Choose a reason for hiding this comment

Uh oh!

yebai Mar 26, 2025

Choose a reason for hiding this comment

Uh oh!

devmotion Mar 26, 2025

Choose a reason for hiding this comment

Uh oh!

yebai Mar 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

yebai commented Mar 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yebai Mar 17, 2025 •

edited

Loading

yebai Mar 26, 2025 •

edited

Loading