Avoid spread intrinsic #1281

foxtran · 2025-05-12T10:59:14Z

With gfortran, spread intrinsic is not optimized (more precisely this is libgfortran call) and therefore it significantly affects on performance, especially for 3-body terms.

This patch speeds up gradient code in 1.5-2x times with gfortran as well other parts which I did not measure. By some reason, it also affects ifx but it gives only 10% speed up for deriv_atm_triple subroutine.

I have used unrolled cycles since they provide an extra optimizations for ifx: time was improved from 680 to 625 ms for my input. Usual cycles does not give this improvement: see the assembler difference here: https://godbolt.org/z/zs6e7vzT4. The first source code is the original, the second is presented in this PR and the third one uses cycles.

I did not touched initializations/io parts where spread are used.

src/coulomb/gaussian.f90

thfroitzheim · 2025-05-13T21:44:36Z

I did not touch spread hell in gfnff/gfnff.f90. Also, I did not touched initializations/io parts where spread also used.

I know this is the annoying part, but GFN-FF would likely benefit the most from improved parallelization (i.e. in #1240). Likely much more than just a few ms here and there

foxtran · 2025-05-14T06:12:17Z

Likely much more than just a few ms here and there

It is not a few ms for here and there for GCC, unfortunately.

foxtran · 2025-05-16T07:57:02Z

For GFN-FF and bench.xyz/1666_0064xGluAla.xyz, for GCC I got a 20% speed-up with this patch: from 0.78 sec/iter to 0.63 sec/iter in single-threaded run.

Signed-off-by: Igor S. Gerasimov <[email protected]>

foxtran · 2025-05-20T07:36:07Z

@thfroitzheim, ping :-)

thfroitzheim

LGTM

thfroitzheim reviewed May 13, 2025

View reviewed changes

src/coulomb/gaussian.f90 Outdated Show resolved Hide resolved

foxtran force-pushed the fix/spread branch from 1193476 to 83aceb1 Compare May 14, 2025 06:11

foxtran force-pushed the fix/spread branch 2 times, most recently from 8c6dfe0 to c64a94f Compare May 14, 2025 08:50

foxtran requested a review from thfroitzheim May 16, 2025 08:49

foxtran added 16 commits May 18, 2025 11:27

Avoid spread intrinsic in coulomb/

06dc4be

Signed-off-by: Igor S. Gerasimov <[email protected]>

Avoid spread intrinsic in solv/

fc9baf2

Signed-off-by: Igor S. Gerasimov <[email protected]>

Avoid spread intrinsic in peeq_module

7f1c923

Signed-off-by: Igor S. Gerasimov <[email protected]>

Avoid spread intrinsic in freq/prooject.f90

56f4b35

Signed-off-by: Igor S. Gerasimov <[email protected]>

Avoid spread intrinsic in type/coulomb

e4fae42

Signed-off-by: Igor S. Gerasimov <[email protected]>

Avoid spread intrinsic in xtb/hamiltonian, xtb/repulsion

1e8c9a6

Signed-off-by: Igor S. Gerasimov <[email protected]>

Avoid spread intrinsic in disp/

b538eba

Signed-off-by: Igor S. Gerasimov <[email protected]>

Avoid spread intrinsic in gfnff/gdisp0

ee95929

Signed-off-by: Igor S. Gerasimov <[email protected]>

Add note about GCC perf

13013e5

Signed-off-by: Igor S. Gerasimov <[email protected]>

Avoid spread intrinsic in abhgfnff_eg3

d35394c

Signed-off-by: Igor S. Gerasimov <[email protected]>

Avoid spread intrinsic in abhgfnff_eg2_rnr

b55f500

Signed-off-by: Igor S. Gerasimov <[email protected]>

Simplify expression

51477d2

Signed-off-by: Igor S. Gerasimov <[email protected]>

Avoid spread intrinsic in abhgfnff_eg2new

2d9043c

Signed-off-by: Igor S. Gerasimov <[email protected]>

Avoid spread intrinsic in abhgfnff_eg1

c3c615b

Signed-off-by: Igor S. Gerasimov <[email protected]>

Avoid spread intrinsic in rbxgfnff_eg

9284bb0

Signed-off-by: Igor S. Gerasimov <[email protected]>

Avoid rest spread intrinsic in gfnff/gfnff_eg

26fbabf

Signed-off-by: Igor S. Gerasimov <[email protected]>

foxtran force-pushed the fix/spread branch from 944474d to 26fbabf Compare May 18, 2025 09:27

thfroitzheim approved these changes May 21, 2025

View reviewed changes

thfroitzheim merged commit fe1c8ce into grimme-lab:main May 21, 2025
23 checks passed

foxtran deleted the fix/spread branch May 21, 2025 13:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid spread intrinsic #1281

Avoid spread intrinsic #1281

Uh oh!

foxtran commented May 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

thfroitzheim commented May 13, 2025

Uh oh!

foxtran commented May 14, 2025 •

edited

Loading

Uh oh!

foxtran commented May 16, 2025 •

edited

Loading

Uh oh!

foxtran commented May 20, 2025

Uh oh!

thfroitzheim left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Avoid spread intrinsic #1281

Avoid spread intrinsic #1281

Uh oh!

Conversation

foxtran commented May 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

thfroitzheim commented May 13, 2025

Uh oh!

foxtran commented May 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

foxtran commented May 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

foxtran commented May 20, 2025

Uh oh!

thfroitzheim left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

foxtran commented May 12, 2025 •

edited

Loading

foxtran commented May 14, 2025 •

edited

Loading

foxtran commented May 16, 2025 •

edited

Loading