This is a request to add documentation for CUDA devices. For example, it would be great to see:
- Instructions for setting up the Conda environment and a
.yaml file for CUDA devices
- Instructions for building DPBench (with support for the SYCL implementations)
- Run instructions to target CUDA devices with the benchmarks
- Which implementations of which benchmarks can we run on CUDA devices (CuPy and numba_cuda are expected, but it would be great to be able to run the SYCL implementations too)
Thanks!