This sample demonstrates the usage of cuSPARSELt library and cusparseMatMul APIs for performing structured matrix - dense matrix multiplication by exploiting NVIDIA Sparse Tensor Cores, where the structured matrix is compressed with 50% sparsity ratio.
C = alpha * A * B + beta * C
where A, B, C are dense matrices
-
Linux
make
-
or in alternative:
mkdir build cd build cmake .. make
- Supported SM Architectures: SM 8.0, SM 8.6
- Supported OSes: Linux, Windows
- Supported CPU Architectures: x86_64, arm64
- Supported Compilers: gcc, clang, Intel icc, IBM xlc, Microsoft msvc, Nvidia HPC SDK nvc
- Language:
C++14
- CUDA 11.2 toolkit (or above) and compatible driver (see CUDA Driver Release Notes).
- cusparseLt
- CMake 3.9 or above