vulkan: handle mat_mul with A matrix > 4GB #16176

jeffbolznv · 2025-09-22T16:06:50Z

This change splits mat_mul operations with huge A matrix into chunks in the M dimension. This works well for stable-diffusion use cases where the im2col matrix has very large M.

Fix the order of setting the stride in mul_mm_cm2 - setting the dimension clobbers the stride, so stride should be set after.

This, along with #16135, should be enough to get stable-diffusion wan support working. This isn't meant to be a general direction for handling huge buffers, just a relatively small change that enables these interesting models.

0cc4m

Same issue with the allocation size, but fine otherwise.

This change splits mat_mul operations with huge A matrix into chunks in the M dimension. This works well for stable-diffusion use cases where the im2col matrix has very large M. Fix the order of setting the stride in mul_mm_cm2 - setting the dimension clobbers the stride, so stride should be set after.

* vulkan: handle mat_mul with A matrix > 4GB This change splits mat_mul operations with huge A matrix into chunks in the M dimension. This works well for stable-diffusion use cases where the im2col matrix has very large M. Fix the order of setting the stride in mul_mm_cm2 - setting the dimension clobbers the stride, so stride should be set after. * build fixes

jeffbolznv requested review from 0cc4m and slaren as code owners September 22, 2025 16:06

github-actions bot added testing Everything test related Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning labels Sep 22, 2025

0cc4m approved these changes Sep 27, 2025

View reviewed changes

jeffbolznv force-pushed the split_m branch from b65b8f6 to b80d864 Compare September 27, 2025 15:33

slaren approved these changes Sep 27, 2025

View reviewed changes

build fixes

c948908

jeffbolznv merged commit 1384abf into ggml-org:master Sep 28, 2025
64 of 67 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

vulkan: handle mat_mul with A matrix > 4GB #16176

vulkan: handle mat_mul with A matrix > 4GB #16176

Uh oh!

jeffbolznv commented Sep 22, 2025

Uh oh!

0cc4m left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vulkan: handle mat_mul with A matrix > 4GB #16176

vulkan: handle mat_mul with A matrix > 4GB #16176

Uh oh!

Conversation

jeffbolznv commented Sep 22, 2025

Uh oh!

0cc4m left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants