cuda: Add explicit check for the use of Cuda Toolkit's >= 13 with ccs < 7.5#486
Merged
Treece-Burgess merged 1 commit intoicl-utk-edu:masterfrom Oct 12, 2025
Conversation
|
I am testing this PR. |
tokey-tahmid
approved these changes
Oct 8, 2025
There was a problem hiding this comment.
I tested the PR on a machine with 1 * V100 and 1 * H100 with CUDA Toolkit 13.0. With export CUDA_VISIBLE_DEVICES, PAPI Utilities, and CUDA component tests function as expected. And the updated disable message is seen when CUDA_VISIBLE_DEVICES variable is not set.
da127a5 to
e4dbac6
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Pull Request Description
In Cuda Toolkit 13, support for offline compilation for architectures prior to cc's < 7.5 have been removed.
Currently the
cudacomponent is able to be compiled using Cuda Toolkit 13 on a machine with cc's < 7.5. However, if a user ran./papi_component_availthey would be met with thecudacomponent being disabled with a message of:This PR introduces a conditional check to make sure that a Cuda Toolkit version >= 13 is not being used on a machine with devices that have cc's < 7.5. If this conditional check is met then a more apt error message is now provided:
Testing
Using Cuda Toolkit 13 on a machine with mixed cc's (1 * V100 and 1 * H100) and using
export CUDA_VISIBLE_DEVICES=0which corresponds to the H100 only being seen results in:cudacomponent being activepapi_component_avail,papi_native_avail, andpapi_command_lineall successfully workingcudacomponent tests all passingUsing Cuda Toolkit 12.6 on a machine with mixed cc's (1 * V100 and 1 * H100) results in:
cudacomponent being activepapi_component_avail,papi_native_avail, andpapi_command_lineall successfully workingcudacomponent tests all passingAuthor Checklist
Why this PR exists. Reference all relevant information, including background, issues, test failures, etc
Commits are self contained and only do one thing
Commits have a header of the form:
module: short descriptionCommits have a body (whenever relevant) containing a detailed description of the addressed problem and its solution
The PR needs to pass all the tests