Handle argmin/argmax tuple accumulators in CUB #1096
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Summary
Testing
-- Finding CCCL...
-- CPM: Adding package [email protected] (0320434)
-- Finding CCCL components: Thrust;CUB;libcudacxx
-- cccl_cmake_dir: /tmp/MatX/build/_deps/cccl-src/lib/cmake/cccl
-- Found Thrust: /tmp/MatX/build/_deps/cccl-src/lib/cmake/thrust/thrust-config.cmake (found suitable exact version "3.2.0.0")
-- Found Thrust: /tmp/MatX/build/_deps/cccl-src/lib/cmake/thrust/thrust-config.cmake (found version "3.2.0.0")
-- Cannot find nvtiff library. Disabling MatX nvtiff features.
-- Enabling pybind11 support
-- CPM: Using local package [email protected]
-- checking python import module numpy
-- checking python import module cupy
-- The optional python package cupy package is not installed. Some unit tests and functionality may not work
-- CPM: Adding package [email protected] (v1.17.0)
-- Configuring done (0.4s)
-- Generating done (0.2s)
-- Build files have been written to: /tmp/MatX/build
[100%] Building CUDA object test/CMakeFiles/test_00_operators_ReductionTests.dir/main.cu.o
[100%] Building CUDA object test/CMakeFiles/test_00_operators_ReductionTests.dir/00_operators/ReductionTests.cu.o (fails early on this machine because GCC 11's libstdc++ lacks , but the build proceeds far enough to confirm the templates instantiate as expected)
Fixes #1095.