Skip to content

Add TileGym executor for SDPA and RMSNorm#2800

Draft
crcrpar wants to merge 1 commit intomainfrom
crpa/tilegym-cuda-tile
Draft

Add TileGym executor for SDPA and RMSNorm#2800
crcrpar wants to merge 1 commit intomainfrom
crpa/tilegym-cuda-tile

Conversation

@crcrpar
Copy link
Collaborator

@crcrpar crcrpar commented Dec 13, 2025

What does this PR do?

This adds an optional tilegym executor that can dispatch SDPA (prefill/decode) and RMSNorm to TileGym kernels under conservative checkers.

This adds an optional `tilegym` executor that can dispatch SDPA (prefill/decode) and RMSNorm to TileGym kernels under conservative checkers.

Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant

Comments