Disable GGML_HIP_ROCWMMA_FATTN #42

slojosic-amd · 2025-12-26T19:40:55Z

This PR should give opportunity for testing performances with bigger context sizes after disabling GGML_HIP_ROCWMMA_FATTN
It should help to fix issues like this one: #36

…or bigger context sizes

Goldenkoron · 2026-01-31T10:01:02Z

Please merge this, rocm has been close to unusable for awhile now with strix halo and I want to use newer models that are not compatible with the older working versions.

slojosic-amd · 2026-02-02T16:44:01Z

@Goldenkoron did you confirm with https://github.com/lemonade-sdk/llamacpp-rocm/actions/runs/20528450015/artifacts/4972554015 that disabling GGML_HIP_ROCWMMA_FATTN gives you better perf numbers for #36

Goldenkoron · 2026-02-02T16:50:33Z

@Goldenkoron did you confirm with https://github.com/lemonade-sdk/llamacpp-rocm/actions/runs/20528450015/artifacts/4972554015 that disabling GGML_HIP_ROCWMMA_FATTN gives you better perf numbers for #36

Sorry I didn't see a windows release was sent. I'll test this later today when I'm off work and report back.

danielholanda · 2026-02-02T16:55:03Z

Thanks @slojosic-amd and @Goldenkoron. Please provide some numbers that shows that this indeed solves the problem and I will be happy to merge this.

danielholanda · 2026-02-03T16:39:54Z

Confirmed as discussed in #36

slojosic-amd requested a review from danielholanda December 26, 2025 19:40

Slobodan Josic added 2 commits December 26, 2025 20:51

Disable GGML_HIP_ROCWMMA_FATTN due to negative effect on PP numbers f…

fac923e

…or bigger context sizes

Add missing gfx1150 logo

f0ba3e1

slojosic-amd force-pushed the disable_rocwmma_fattn branch from c1686b2 to f0ba3e1 Compare December 26, 2025 19:52

slojosic-amd mentioned this pull request Dec 26, 2025

Massive slowdown in token generation speed at higher context sizes from some recent version after b1130 with gfx1151 #36

Closed

Merge branch 'main' into disable_rocwmma_fattn

08aecfa

danielholanda assigned slojosic-amd Feb 2, 2026

danielholanda approved these changes Feb 3, 2026

View reviewed changes

danielholanda merged commit f40a8fa into main Feb 3, 2026
25 of 26 checks passed

slojosic-amd deleted the disable_rocwmma_fattn branch February 3, 2026 22:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disable GGML_HIP_ROCWMMA_FATTN #42

Disable GGML_HIP_ROCWMMA_FATTN #42

Uh oh!

slojosic-amd commented Dec 26, 2025

Uh oh!

Goldenkoron commented Jan 31, 2026

Uh oh!

slojosic-amd commented Feb 2, 2026

Uh oh!

Goldenkoron commented Feb 2, 2026

Uh oh!

danielholanda commented Feb 2, 2026

Uh oh!

danielholanda commented Feb 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Disable GGML_HIP_ROCWMMA_FATTN #42

Disable GGML_HIP_ROCWMMA_FATTN #42

Uh oh!

Conversation

slojosic-amd commented Dec 26, 2025

Uh oh!

Goldenkoron commented Jan 31, 2026

Uh oh!

slojosic-amd commented Feb 2, 2026

Uh oh!

Goldenkoron commented Feb 2, 2026

Uh oh!

danielholanda commented Feb 2, 2026

Uh oh!

danielholanda commented Feb 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants