You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[AICOMRCCL-697] Add --enable-mpi-tests and --cmake-options to install.sh (#3862)
## Motivation
- Add --enable-mpi-tests flag (requires --debug; MPI tests reference
internal RCCL symbols hidden in release builds)
- Add --cmake-options pass-through for arbitrary CMake -D options
- Update docs/install/installation.rst with new options and environment
variable documentation (ONLY_FUNCS)
## Technical Details
<!-- Explain the changes along with any relevant GitHub links. -->
## JIRA ID
AICOMRCCL-697
## Test Plan
Verified that --enable-mpi-tests correctly gates on --debug and passes
-DENABLE_MPI_TESTS=ON to CMake. Verified --cmake-options appends options
to the CMake invocation.
## Test Result
<!-- Briefly summarize test outcomes. -->
## Submission Checklist
- [ ] Look over the contributing guidelines at
https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.
--enable_backtrace Build with custom backtrace support
58
60
--disable-colltrace Build without collective trace
59
-
--disable-msccl-kernel Build without MSCCL kernels
61
+
--enable-msccl-kernel Build with MSCCL kernels
62
+
--dump-asm Disassemble code and dump assembly with inline code
60
63
--enable-mscclpp Build with MSCCL++ support
64
+
--enable-mscclpp-clip Build MSCCL++ with clip wrapper on bfloat16 and half addition routines
65
+
--disable-roctx Build without ROCTX logging
61
66
-f|--fast Quick-build RCCL (local gpu arch only, no backtrace, and collective trace support)
62
67
-h|--help Prints this help message
63
68
-i|--install Install RCCL library (see --prefix argument below)
64
-
-j|--jobs Specify how many parallel compilation jobs to run ($nproc by default)
69
+
-j|--jobs Specify how many parallel compilation jobs to run (128 by default)
70
+
--kernel-resource-use Dump GPU kernel resource usage (e.g., VGPRs, scratch, spill) at link stage
65
71
-l|--local_gpu_only Only compile forlocal GPU architecture
66
72
--amdgpu_targets Only compile for specified GPU architecture(s). For multiple targets, separate by ';' (builds for all supported GPU architectures by default)
67
73
--no_clean Don't delete files if they already exist
68
74
--npkit-enable Compile with npkit enabled
75
+
--log-trace Build with log trace enabled (i.e. NCCL_DEBUG=TRACE)
76
+
--enable-mpi-tests Enable MPI-based tests (requires --debug and MPI installation; set MPI_PATH if not in /opt/ompi)
69
77
--openmp-test-enable Enable OpenMP in rccl unit tests
70
-
--roctx-enable Compile with roctx enabled (example usage: rocprof --roctx-trace ./rccl-program)
71
78
-p|--package_build Build RCCL package
72
79
--prefix Specify custom directory to install RCCL to (default: `/opt/rocm`)
73
-
--rm-legacy-include-dir Remove legacy include dir Packaging added for file/folder reorg backward compatibility
74
80
--run_tests_all Run all rccl unit tests (must be built already)
75
81
-r|--run_tests_quick Run small subset of rccl unit tests (must be built already)
76
82
--static Build RCCL as a static library instead of shared library
77
83
-t|--tests_build Build rccl unit tests, but do not run
78
84
--time-trace Plot the build time of RCCL (requires `ninja-build` package installed on the system)
79
85
--verbose Show compile commands
86
+
--force-reduce-pipeline Force reduce_copy sw pipeline to be used for every reduce-based collectives and datatypes
0 commit comments