Skip to content

Conversation

@Critsium-xy
Copy link
Collaborator

--ntasks=4 Result
image

--ntasks=1 Result
image

Not fully optimized yet. Memory management is still in construction. It may be 2 times faster after it is finished.

@mohanchen mohanchen added the GPU & DCU & HPC GPU and DCU and HPC related any issues label Oct 26, 2024
@mohanchen mohanchen merged commit 72d9d1d into deepmodeling:develop Oct 26, 2024
14 checks passed
@Critsium-xy Critsium-xy deleted the mtblas_parallel branch October 28, 2024 02:37
Fisherd99 pushed a commit to Fisherd99/abacus-BSE that referenced this pull request Mar 31, 2025
* Fix parallel function

* Fix parallel usage

* Temporarily remove memory_op porting
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

GPU & DCU & HPC GPU and DCU and HPC related any issues

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants