Skip to content

Refactor cub::BlockLoadand cub::BlockStore#8120

Open
bernhardmgruber wants to merge 3 commits intoNVIDIA:mainfrom
bernhardmgruber:ref_blockload
Open

Refactor cub::BlockLoadand cub::BlockStore#8120
bernhardmgruber wants to merge 3 commits intoNVIDIA:mainfrom
bernhardmgruber:ref_blockload

Conversation

@bernhardmgruber
Copy link
Contributor

@bernhardmgruber bernhardmgruber commented Mar 20, 2026

Replaces the LoadInternal and StoreInternal partial template specializations by if constexpr. Common code paths are de-duplicated. This leads to a lot more concise code.

  • No SASS changes on cub.test.block.load.it_11 for SM120
  • No SASS changes on cub.test.block.store.it_11 for SM120

@bernhardmgruber bernhardmgruber requested a review from a team as a code owner March 20, 2026 16:22
@github-project-automation github-project-automation bot moved this to Todo in CCCL Mar 20, 2026
@cccl-authenticator-app cccl-authenticator-app bot moved this from Todo to In Review in CCCL Mar 20, 2026
@fbusato
Copy link
Contributor

fbusato commented Mar 20, 2026

the PR is pretty big. Could you please add a brief summary of the changes in the description?

@github-actions

This comment has been minimized.

@github-actions
Copy link
Contributor

🥳 CI Workflow Results

🟩 Finished in 5h 03m: Pass: 100%/249 | Total: 8d 17h | Max: 1h 46m | Hits: 59%/159565

See results here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Status: In Review

Development

Successfully merging this pull request may close these issues.

3 participants