CUDA/HIP: Fix fattn-vec-* when device warp size is not 32 #12315

IMbackK · 2025-03-10T18:36:31Z

When fattn-wmma was ported over to warp64 various bit that also touch fattn-vec where converted to selectable warp size, however the fattn-vec kernels dont work with 64 wide warps for now, so we need to avoid launching them with parameters for warp64

This is a temporary fix for #12238 until we adjust the fattn_vec kernels to work when head size == warp size

When fattn-wmma was ported over to warp64 various bit that also touch fattn-vec where converted to selectable warp size, however the fattn-vec kernels dont work with 64 wide warps for now, so we need to avoid launching them with parameters for warp64

…2315) When fattn-wmma was ported over to warp64 various bits that also touch fattn-vec where converted to selectable warp size, however the fattn-vec kernels dont work with 64 wide warps for now, so we need to avoid launching them with parameters for warp64

IMbackK requested a review from JohannesGaessler as a code owner March 10, 2025 18:36

github-actions bot added Nvidia GPU Issues specific to Nvidia GPUs ggml changes relating to the ggml tensor library for machine learning labels Mar 10, 2025

IMbackK force-pushed the fattn_fix branch from fb4cd9e to 758126a Compare March 10, 2025 18:39

JohannesGaessler approved these changes Mar 12, 2025

View reviewed changes

IMbackK merged commit 34c961b into ggml-org:master Mar 12, 2025
47 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CUDA/HIP: Fix fattn-vec-* when device warp size is not 32 #12315

CUDA/HIP: Fix fattn-vec-* when device warp size is not 32 #12315

Uh oh!

IMbackK commented Mar 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

CUDA/HIP: Fix fattn-vec-* when device warp size is not 32 #12315

CUDA/HIP: Fix fattn-vec-* when device warp size is not 32 #12315

Uh oh!

Conversation

IMbackK commented Mar 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants