Skip to content

Refactor cub::BlockLoadand cub::BlockStore#8120

Merged
bernhardmgruber merged 3 commits intoNVIDIA:mainfrom
bernhardmgruber:ref_blockload
Mar 21, 2026
Merged

Refactor cub::BlockLoadand cub::BlockStore#8120
bernhardmgruber merged 3 commits intoNVIDIA:mainfrom
bernhardmgruber:ref_blockload

Conversation

@bernhardmgruber
Copy link
Contributor

@bernhardmgruber bernhardmgruber commented Mar 20, 2026

Replaces the LoadInternal and StoreInternal partial template specializations by if constexpr. Common code paths are de-duplicated. This leads to a lot more concise code.

  • No SASS changes on cub.test.block.load.it_11 for SM120
  • No SASS changes on cub.test.block.store.it_11 for SM120

@bernhardmgruber bernhardmgruber requested a review from a team as a code owner March 20, 2026 16:22
@github-project-automation github-project-automation bot moved this to Todo in CCCL Mar 20, 2026
@cccl-authenticator-app cccl-authenticator-app bot moved this from Todo to In Review in CCCL Mar 20, 2026
@fbusato
Copy link
Contributor

fbusato commented Mar 20, 2026

the PR is pretty big. Could you please add a brief summary of the changes in the description?

@github-actions

This comment has been minimized.

@github-actions
Copy link
Contributor

🥳 CI Workflow Results

🟩 Finished in 5h 03m: Pass: 100%/249 | Total: 8d 17h | Max: 1h 46m | Hits: 59%/159565

See results here.

@bernhardmgruber bernhardmgruber merged commit d36cdc5 into NVIDIA:main Mar 21, 2026
527 of 534 checks passed
@bernhardmgruber bernhardmgruber deleted the ref_blockload branch March 21, 2026 20:57
@github-project-automation github-project-automation bot moved this from In Review to Done in CCCL Mar 21, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

3 participants