Skip to content

Conversation

@ErjieWu
Copy link
Collaborator

@ErjieWu ErjieWu commented Aug 26, 2025

Linked Issue

Partially accleration for #6416 request.

What's changed?

  • Avoid some duplicate calculations in cal_overlap for acceleration.
  • Change some functions to inline function.

Performance

1a927cb5-f8b8-4509-b973-bbaaf470470c

Test is for 500 C atoms. Notice that we did not add omp parallel, since in this function the communication cost will significantly larger than calculation cost when using omp. Currently, the bottleneck is the call for std::map (NOT I/O PROCESS), this cannot be easily solved since it effect many other modules (requires refactor for core module).

@ErjieWu ErjieWu requested a review from mohanchen August 27, 2025 01:51
@ErjieWu ErjieWu added the Performance Issues related to fail running ABACUS label Aug 27, 2025
@mohanchen
Copy link
Collaborator

Good job, I will take a look at the map operation later.

@mohanchen mohanchen added the Refactor Refactor ABACUS codes label Aug 27, 2025
@mohanchen mohanchen merged commit 41b5045 into deepmodeling:develop Aug 27, 2025
14 checks passed
@ErjieWu ErjieWu deleted the perf branch August 27, 2025 06:40
@JTaozhang
Copy link

thanks for test. cheers!!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Performance Issues related to fail running ABACUS Refactor Refactor ABACUS codes

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants