-
Notifications
You must be signed in to change notification settings - Fork 65
Open
Description
Similar to https://github.com/JuliaGPU/NCCL.jl/, it would be great to have wrappers for RCCL for training DL models on multiple AMDGPUs.
(I know MPI.jl has ROCm support but we don't ship JLLs that are rocm aware, so it might be good to ship RCCL_jll that automatically allows rocm aware communication without copying to CPU)
Metadata
Metadata
Assignees
Labels
No labels