Skip to content
Change the repository type filter

All

    Repositories list

    • TypeScript
      0001Updated Sep 11, 2025Sep 11, 2025
    • Uniconn

      Public
      Uniconn is a unified, portable high-level C++ communication library that supports both point-to-point and collective operations across GPU clusters. Uniconn enables seamless switching between backends and APIs (host or device) with minimal or no changes to application code.
      Cuda
      0000Updated Aug 31, 2025Aug 31, 2025
    • aCG

      Public
      GPU-accelerated linear solvers based on the conjugate gradient (CG) method, supporting NVIDIA and AMD GPUs with GPU-aware MPI, NCCL, RCCL or NVSHMEM
      C
      0100Updated Aug 25, 2025Aug 25, 2025
    • C++
      1200Updated Mar 27, 2025Mar 27, 2025
    • Modified ucx library to track communications
      C
      483000Updated Mar 10, 2025Mar 10, 2025
    • Cuda
      1410Updated Jun 13, 2024Jun 13, 2024
    • Snoopie

      Public
      Multi-GPU communication profiler and visualizer
      C
      33230Updated Jun 10, 2024Jun 10, 2024
    • GPU fusion code and algorithm
      Cuda
      0100Updated May 24, 2024May 24, 2024
    • barnes

      Public
      C
      0000Updated May 15, 2024May 15, 2024
    • 0020Updated May 10, 2024May 10, 2024
    • Source code for the CPU-Free model - a fully autonomous execution model for multi-GPU applications that completely excludes the involvement of the CPU beyond the initial kernel launch.
      Cuda
      32000Updated Apr 25, 2024Apr 25, 2024
    • C
      0200Updated Apr 25, 2024Apr 25, 2024
    • BeyondMoore has an ambitious goal to develop a software framework that performs static and dynamic optimizations, issues accelerator-initiated data transfers, and reasons about parallel execution strategies that exploit both processor and memory heterogeneity.
      0200Updated Apr 25, 2024Apr 25, 2024
    • .github

      Public
      Homepage README.
      0000Updated Apr 4, 2024Apr 4, 2024
    • C
      0000Updated Mar 22, 2024Mar 22, 2024
    • DaCe - Data Centric Parallel Programming
      Python
      141000Updated Feb 2, 2024Feb 2, 2024
    • splash2

      Public
      Splash 2 Benchmarks
      C
      12000Updated Nov 28, 2023Nov 28, 2023
    • ComScribe

      Public
      ComScribe is a tool to identify communication among all GPU-GPU and CPU-GPU pairs in a single-node multi-GPU system.
      C++
      42512Updated Jul 6, 2023Jul 6, 2023
    • C++
      0000Updated Jun 13, 2023Jun 13, 2023
    • HPCToolkit performance tools: measurement and analysis components
      C++
      60001Updated Mar 17, 2023Mar 17, 2023
    • The microbenchmarks that are used to verify the accuracy of ComDetective.
      Makefile
      2000Updated Mar 17, 2023Mar 17, 2023
    • Mixed and Multi-Precision SpMV for GPUs with Row-wise Precision Selection.
      Cuda
      1510Updated Mar 12, 2023Mar 12, 2023
    • A fast and accurate reuse distance analyzer for multi-threaded applications. It leverages existing hardware features in commodity CPUs.
      Shell
      32010Updated Feb 3, 2023Feb 3, 2023
    • HPCToolkit performance tools: essential third party libraries for hpctoolkit
      Shell
      6000Updated Oct 9, 2022Oct 9, 2022
    • AMD Research Instruction Based Sampling Toolkit
      C
      17000Updated Aug 6, 2022Aug 6, 2022
    • pardnn

      Public
      C++
      1100Updated May 20, 2022May 20, 2022
    • C
      1000Updated Apr 16, 2022Apr 16, 2022
    • The split execution framework can automatically determine the suitability of an SpTRSV for split-execution, find the appropriate split point, and execute SpTRSV in a split fashion using two SpTRSV algorithms while automatically managing any required inter-platform communication. The model is implemented as a C++/CUDA library supporting multiple …
      C++
      0400Updated Sep 7, 2021Sep 7, 2021
    • The SpTRSV prediction framework is an automated prediction framework for the fastest sparse triangular solve (SpTRSV) algorithm for a given input sparse matrix on a CPU-GPU platform.
      C++
      2600Updated Aug 17, 2020Aug 17, 2020