hip : fix warp mask width for rocWMMA compatibility #15239

lhl · 2025-08-11T12:56:13Z

I ran into issues compiling w/ -DGGML_HIP_ROCWMMA_FATTN=ON with the latest TheRock/ROCm (7.0) nightly releases. This appears to be due to a warp mask width incompatibility recently introduced between ROCm's rocWMMA library and CUDA-style sync code.

ROCm's rocWMMA library recently added its own __shfl_sync and __shfl_xor_sync functions but it also requires 64-bit masks while the existing code uses hardcoded 32-bit masks (0xFFFFFFFF). This causes type conflicts and compilation failures when building with rocWMMA support enabled.

Added ifndef to hip.h for the sync functions
Added GGML_CUDA_WARP_MASK macro in ggml-cuda/common.cuh and ggml-cuda/vendors/hip.h
Replaced all hardcoded warp masks with the new macro across CUDA files
tested builds on CUDA and HIP

I tried fixing w/ just messing w/ etc just in hip.h and leaving the CUDA files alone but I think there really isn't a clean alternative of replacing the hard-coded masks?

JohannesGaessler · 2025-08-11T18:32:47Z

When I grep rocWMMA for "shfl" I am not finding anything.

IMbackK · 2025-08-11T20:08:46Z

dup of #15241 please see discussion in that pr.

hip : fix warp mask width for rocWMMA compatibility

3ad74d4

lhl requested a review from JohannesGaessler as a code owner August 11, 2025 12:56

github-actions bot added Nvidia GPU Issues specific to Nvidia GPUs ggml changes relating to the ggml tensor library for machine learning labels Aug 11, 2025

IMbackK closed this Aug 11, 2025

lhl mentioned this pull request Aug 12, 2025

For gfx1151, llama.cpp should be built with -DGGML_HIP_ROCWMMA_FATTN=ON for a big performance boost lemonade-sdk/llamacpp-rocm#7

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

hip : fix warp mask width for rocWMMA compatibility #15239

hip : fix warp mask width for rocWMMA compatibility #15239

Uh oh!

lhl commented Aug 11, 2025

Uh oh!

JohannesGaessler commented Aug 11, 2025

Uh oh!

IMbackK commented Aug 11, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

hip : fix warp mask width for rocWMMA compatibility #15239

hip : fix warp mask width for rocWMMA compatibility #15239

Uh oh!

Conversation

lhl commented Aug 11, 2025

Uh oh!

JohannesGaessler commented Aug 11, 2025

Uh oh!

IMbackK commented Aug 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

IMbackK commented Aug 11, 2025 •

edited

Loading