Conversation
Profiling mrregister on macOS showed that the hottest symbol was `MR::Interp::LinearInterp<...>::row(unsigned long)` called from `Registration::Warp::DisplacementThreadKernel::invert_displacement()`. That generic interpolation path builds a dynamic Eigen matrix for each sample, which is often unnecessary for vector fields using 3 component vectors. This change the implementation of `LinearInterp::row()` to avoid constructing an 3x8 matrix and instead does the weighted summation inside the for loop. Additionally, a new vec3() wrapper is added to avoid dynamic allocation for the common case of interpolating 3-volume vector fields. This helps reducing temporary allocations while preserving the existing interpolation behavior.
|
clang-tidy review says "All clean, LGTM! 👍" |
|
clang-tidy review says "All clean, LGTM! 👍" |
Lestropie
reviewed
Mar 16, 2026
Lestropie
reviewed
Mar 16, 2026
Member
|
Left a couple of comments. I wonder if there might be other places in the code base where |
|
clang-tidy review says "All clean, LGTM! 👍" |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Profiling mrregister on macOS showed that the hottest symbol was
MR::Interp::LinearInterp<...>::row(unsigned long)called fromRegistration::Warp::DisplacementThreadKernel::invert_displacement(). That generic interpolation path builds a dynamic Eigen matrix for each sample, which is often unnecessary for vector fields using 3 component vectors.This change the implementation of
LinearInterp::row()to avoid constructing an 3x8 matrix and instead does the weighted summation inside the for loop. Additionally, a newvec3()wrapper is added to avoid dynamic allocation for the common case of interpolating 3-volume vector fields. This helps reducing temporary allocations while preserving the existing interpolation behavior.In my tests, this doesn't yield any noticeable performance gains on Linux, but on macOS there seems to be a consistent 15-20% performance improvement when running scalar registration of 3D volumes.