Add two LCAO base group GPU version compilation options in toolchain #6014

tang070205 · 2025-03-16T08:02:46Z

Firstly, it's cusolverMp merhod

Change the link libraries for the cal and cusolverMp sections in the CMakeLists.txt file to manually specified, and then manually pass the parameters in cmake -D CAL_CUSOLVERMP_PATH=/path/to/lib

Add the following options after build_abacus_gnu.sh
-DUSE_CUDA=ON \
-DENABLE_CUSOLVERMP=ON \
-D CAL_CUSOLVERMP_PATH=/opt/nvidia/hpc_sdk/Linux_x86_64/24.11/math_libs/12.6/targets/x86_64-linux/lib
CAL_CUSOLVERMP-PATH needs to be set according to different environments

next it is necessary to set up the environment for cal and hpcx, which can be added to~/. bashrc or manually set up an env.sh
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/opt/nvidia/hpc_sdk/Linux_x86_64/24.11/comm_libs/12.6/hpcx/hpcx-2.20/ucc/lib
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/opt/nvidia/hpc_sdk/Linux_x86_64/24.11/comm_libs/12.6/hpcx/hpcx-2.20/ucx/lib
export CPATH=$CPATH:/opt/nvidia/hpc_sdk/Linux_x86_64/24.11/math_libs/12.6/targets/x86_64-linux/include

The second is the ELPA method

Add the following options after toolchain_gnu.sh
export CUDA_PATH=/usr/local/cuda
--enable-cuda \
--gpu-ver=89 \
The 40 Series here is newly added in the install_abacus_toolchain.sh file, corresponding to sm_89

The above two methods can be compiled successfully by using the build_abacus_gnu.sh file

QuantumMisaka · 2025-03-19T07:11:22Z

I'll submit a modification. After that, the ELPA-GPU compilation can be smoothly done in any cuda>11.6 environment. the installation and usage of cusolvermp needs more development

- ELPA compiler flags modification - GPU_VER setting modification: user should specify the GPU compability number, but not the GPU name - Modify toolchain_[gnu,intel].sh and build_abacus_[gnu,intel].sh to use the above modification

LCAO-GPU modification

QuantumMisaka · 2025-03-19T10:45:56Z

We will try to add cusolvermp installation & compilation inside toolchain

QuantumMisaka · 2025-03-20T10:45:54Z

We will try to add cusolvermp installation & compilation inside toolchain

Update: The method in deploying cusovermp and the related dependencies （UCC/UCX/libcal）can be multiple (nvhpc-sdk/individual-package) , and different version of these package may have different path, making it very difficult to link them automatcally. Besides, if one wants to use the multiple-GPU calculation, simplily install HPC_SDK automatically is not enough, and the UCC/UCX/cusolvermp need to be complied/deployed according to the server setting. So, we will not add automatically download/link in toolchain now. However, we will add a simple deployment method via HPC-SDK in README for user as reference to install and use the cusolvermp themselves.

update README and cusolvermp

QuantumMisaka

All tests passed. cusolvermp is too complicated to be incorporate

toolchain/README.md

QuantumMisaka · 2025-03-21T05:16:48Z

@mohanchen @dzzz2001 @goodchong We think this version of toolchain is enough now for a simple GPU-LCAO installation with CUDA and ELPA

mohanchen · 2025-03-22T10:05:07Z

LGTM

…eepmodeling#6014) * Add optional LCAO base GPU versions supported by cusolvermp * Add optional LCAO base GPU versions supported by elpa * Add optional LCAO base GPU versions supported by elpa * Add L40S as GPUVER value for sm_89 architecture * Delete a few lines of content to enable Nvidia to compile * Add a specified Fortran mpi compiler for elpa to use * Add CUDA path for use by ELPA-GPU * Add optional LCAO base GPU versions supported by elpa * Modify a small issue * Change to manually specifying the link libraries for CAL and cusolverMp * Add the use of 'cusolvermp' or 'elpa' methods to compile ABACUS GPU-LCAO * Add the use of 'cusolvermp' or 'elpa' methods to compile ABACUS GPU-LCAO * Add the use of 'cusolvermp' or 'elpa' methods to compile ABACUS GPU-LCAO * Add modification - ELPA compiler flags modification - GPU_VER setting modification: user should specify the GPU compability number, but not the GPU name - Modify toolchain_[gnu,intel].sh and build_abacus_[gnu,intel].sh to use the above modification * minor adjustment * update README * give back cmake default option * update README and cusolvermp * Update README.md --------- Co-authored-by: JamesMisaka <[email protected]>

…6014) * Add optional LCAO base GPU versions supported by cusolvermp * Add optional LCAO base GPU versions supported by elpa * Add optional LCAO base GPU versions supported by elpa * Add L40S as GPUVER value for sm_89 architecture * Delete a few lines of content to enable Nvidia to compile * Add a specified Fortran mpi compiler for elpa to use * Add CUDA path for use by ELPA-GPU * Add optional LCAO base GPU versions supported by elpa * Modify a small issue * Change to manually specifying the link libraries for CAL and cusolverMp * Add the use of 'cusolvermp' or 'elpa' methods to compile ABACUS GPU-LCAO * Add the use of 'cusolvermp' or 'elpa' methods to compile ABACUS GPU-LCAO * Add the use of 'cusolvermp' or 'elpa' methods to compile ABACUS GPU-LCAO * Add modification - ELPA compiler flags modification - GPU_VER setting modification: user should specify the GPU compability number, but not the GPU name - Modify toolchain_[gnu,intel].sh and build_abacus_[gnu,intel].sh to use the above modification * minor adjustment * update README * give back cmake default option * update README and cusolvermp * Update README.md --------- Co-authored-by: JamesMisaka <[email protected]>

tang070205 added 8 commits March 16, 2025 13:23

Add optional LCAO base GPU versions supported by cusolvermp

7929675

Add optional LCAO base GPU versions supported by elpa

9b8ed70

Add optional LCAO base GPU versions supported by elpa

0a6a097

Add L40S as GPUVER value for sm_89 architecture

471d6f3

Delete a few lines of content to enable Nvidia to compile

0bbf6f2

Add a specified Fortran mpi compiler for elpa to use

b3defd5

Add CUDA path for use by ELPA-GPU

dad1705

Add optional LCAO base GPU versions supported by elpa

6807886

QuantumMisaka self-assigned this Mar 16, 2025

tang070205 added 2 commits March 16, 2025 22:36

Modify a small issue

a145e53

Change to manually specifying the link libraries for CAL and cusolverMp

5318a96

mohanchen requested a review from QuantumMisaka March 17, 2025 06:33

mohanchen added the Compile & CICD & Docs & Dependencies Issues related to compiling ABACUS label Mar 17, 2025

mohanchen requested a review from dzzz2001 March 17, 2025 06:35

tang070205 added 4 commits March 18, 2025 13:23

Merge branch 'develop' into develop

7f8ddce

Add the use of 'cusolvermp' or 'elpa' methods to compile ABACUS GPU-LCAO

25a2239

Add the use of 'cusolvermp' or 'elpa' methods to compile ABACUS GPU-LCAO

9bd3ffd

Add the use of 'cusolvermp' or 'elpa' methods to compile ABACUS GPU-LCAO

c18551c

QuantumMisaka mentioned this pull request Mar 19, 2025

elpa-gpu compilation error #5872

Closed

10 tasks

QuantumMisaka and others added 5 commits March 19, 2025 17:16

Add modification

c1f832c

- ELPA compiler flags modification - GPU_VER setting modification: user should specify the GPU compability number, but not the GPU name - Modify toolchain_[gnu,intel].sh and build_abacus_[gnu,intel].sh to use the above modification

minor adjustment

2aea32b

update README

539f593

give back cmake default option

eb0ab10

Merge pull request #1 from QuantumMisaka/lcao-gpu-modify

1865435

LCAO-GPU modification

QuantumMisaka and others added 2 commits March 20, 2025 22:13

update README and cusolvermp

13735bc

Merge pull request #2 from QuantumMisaka/lcao-gpu-modify

be89bc7

update README and cusolvermp

QuantumMisaka approved these changes Mar 20, 2025

View reviewed changes

QuantumMisaka reviewed Mar 20, 2025

View reviewed changes

toolchain/README.md Outdated Show resolved Hide resolved

Update README.md

654f8ba

mohanchen approved these changes Mar 22, 2025

View reviewed changes

mohanchen added the GPU & DCU & HPC GPU and DCU and HPC related any issues label Mar 22, 2025

mohanchen merged commit 76af832 into deepmodeling:develop Mar 22, 2025
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add two LCAO base group GPU version compilation options in toolchain #6014

Add two LCAO base group GPU version compilation options in toolchain #6014

Uh oh!

tang070205 commented Mar 16, 2025 •

edited

Loading

Uh oh!

QuantumMisaka commented Mar 19, 2025 •

edited

Loading

Uh oh!

QuantumMisaka commented Mar 19, 2025

Uh oh!

QuantumMisaka commented Mar 20, 2025

Uh oh!

QuantumMisaka left a comment

Uh oh!

Uh oh!

QuantumMisaka commented Mar 21, 2025

Uh oh!

mohanchen commented Mar 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add two LCAO base group GPU version compilation options in toolchain #6014

Add two LCAO base group GPU version compilation options in toolchain #6014

Uh oh!

Conversation

tang070205 commented Mar 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

QuantumMisaka commented Mar 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

QuantumMisaka commented Mar 19, 2025

Uh oh!

QuantumMisaka commented Mar 20, 2025

Uh oh!

QuantumMisaka left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

QuantumMisaka commented Mar 21, 2025

Uh oh!

mohanchen commented Mar 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tang070205 commented Mar 16, 2025 •

edited

Loading

QuantumMisaka commented Mar 19, 2025 •

edited

Loading