Build Triton-distributed (RocSHMEM)

The best practice to use Trition-distributed in AMD GPU

ROCm 6.3.3
torch-2.6/2.8 (torch-2.6 has major improvement, compatible with SGLang, vLLM)
python3.12.8
MI300X/MI325X

Dependencies with other versions may also work well, but this is not guaranteed. If you find any problem in installing, please tell us in Issues.

Setup without docker

make sure torch-rocm is installed for ROCm SDK 6.3.3

pip3 install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/rocm6.3

install ompi for ROCm SDK

sudo apt-get update -y && \
sudo apt install -y libopenmpi-dev

install other dependencies

python3 -m pip install -i https://test.pypi.org/simple hip-python~=6.3.3 # or whatever Rocm version you have
pip3 install pybind11

Warnning of install inside existing dockder

Make sure following repositories granted permission to clone submodules

export TRITON_DIST_HOME=$(readlink -f `pwd`)

git config --global --add safe.directory $TRITON_DIST_HOME/Triton-distributed/3rdparty/rocshmem
git config --global --add safe.directory $TRITON_DIST_HOME/Triton-distributed/3rdparty/triton
git config --global --add safe.directory $TRITON_DIST_HOME/Triton-distributed

git submodule update --init --recursive

Build

python3 python/setup.py build_ext

Test

Currently only single node build supported, multi-node build will be supported soon.

Single node test

bash ./scripts/launch_amd.sh python/triton_dist/test/amd/test_ag_gemm_intra_node.py 8192 8192 29568

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Build Triton-distributed (RocSHMEM)

The best practice to use Trition-distributed in AMD GPU

Setup without docker

Warnning of install inside existing dockder

Build

Test

FilesExpand file tree

build_amd.md

Latest commit

History

build_amd.md

File metadata and controls

Build Triton-distributed (RocSHMEM)

The best practice to use Trition-distributed in AMD GPU

Setup without docker

Warnning of install inside existing dockder

Build

Test