Skip to content

Commit 3fa428a

Browse files
0cc4mqnixsynapse
authored andcommitted
vulkan: apply MUL_MAT_ID subgroup optimization to non-coopmat devices (ggml-org#15524)
* vulkan: use subgroup function for mul_mat_id shader even without coopmat * vulkan: fix compile warnings * vulkan: properly check for subgroup size control and require full subgroups for subgroup mul_mat_id * vulkan: disable subgroup mul_mat_id on devices with subgroups < 16
1 parent 0289568 commit 3fa428a

File tree

3 files changed

+293
-206
lines changed

3 files changed

+293
-206
lines changed

0 commit comments

Comments
 (0)