-
Notifications
You must be signed in to change notification settings - Fork 173
[BLAS][SPARSE][DPC++] Add CUDA_TARGETS option to pass for AoT compila… #710
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
| if (NOT "${ONEMATH_SYCL_IMPLEMENTATION}" STREQUAL "adaptivecpp") | ||
| target_compile_options(ONEMATH::SYCL::SYCL INTERFACE | ||
| -fsycl-targets=nvptx64-nvidia-cuda -fsycl-unnamed-lambda | ||
| ) | ||
| target_link_options(ONEMATH::SYCL::SYCL INTERFACE | ||
| -fsycl-targets=nvptx64-nvidia-cuda | ||
| ) | ||
|
|
||
| if (DEFINED CUDA_TARGETS AND NOT "${CUDA_TARGETS}" STREQUAL "") | ||
| target_compile_options(ONEMATH::SYCL::SYCL INTERFACE | ||
| -Xsycl-target-backend --cuda-gpu-arch=${CUDA_TARGETS} | ||
| ) | ||
| target_link_options(ONEMATH::SYCL::SYCL INTERFACE | ||
| -Xsycl-target-backend --cuda-gpu-arch=${CUDA_TARGETS} | ||
| ) | ||
| endif() | ||
| endif() | ||
|
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this is fine, it is same update as for BLAS, for CUDA + SYCL linking, so no reason to object
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Correct, was just adding it for consistency between the domains. Having a way to add AoT support too and to avoid issues with newer CUDA versions
Context: CUDA 13 dropped support for several older NVIDIA GPU architectures, specifically Maxwell (compute capability 5.x), Pascal (compute capability 6.x), and Volta (compute capability 7.0, 7.2) architectures.
spencerpatty
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
approved from Sparse BLAS perspective
…tion
CUDA_TARGETSsimilar to existing option forHIP_TARGETSfor AoT builds of DPC++ compilergenericblasalready has stubs to use arch flags but was not set