vulkan: optimize coopmat2 dequant functions #10855

jeffbolznv · 2024-12-16T22:28:50Z

Change the code to do 16b loads when possible and extract the appropriate component late, so the code is effectively decoding a pair of elements and then selecting one. This can allow more commoning to happen in the compiler when neighboring elements are loaded.

before
| llama 3B Q5_K - Medium         |   2.16 GiB |     3.21 B | Vulkan     | 1000 |  1 |         pp512 |     5131.92  192.48 |

after
| llama 3B Q5_K - Medium         |   2.16 GiB |     3.21 B | Vulkan     | 1000 |  1 |         pp512 |     5400.29  205.68 |

Change the code to do 16b loads when possible and extract the appropriate component late, so the code is effectively decoding a pair of elements and then selecting one. This can allow more commoning to happen in the compiler when neighboring elements are loaded.

0cc4m

I can't test this without a driver change, but the code looks fine.

Change the code to do 16b loads when possible and extract the appropriate component late, so the code is effectively decoding a pair of elements and then selecting one. This can allow more commoning to happen in the compiler when neighboring elements are loaded.

jeffbolznv requested a review from 0cc4m December 16, 2024 22:28

github-actions bot added Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning labels Dec 16, 2024

0cc4m approved these changes Dec 19, 2024

View reviewed changes

0cc4m merged commit a91a413 into ggml-org:master Dec 21, 2024
2 checks passed

jeffbolznv mentioned this pull request Jan 7, 2025

vulkan: optimize coopmat2 q2_k dequant function #11130

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

vulkan: optimize coopmat2 dequant functions #10855

vulkan: optimize coopmat2 dequant functions #10855

Uh oh!

jeffbolznv commented Dec 16, 2024

Uh oh!

0cc4m left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vulkan: optimize coopmat2 dequant functions #10855

vulkan: optimize coopmat2 dequant functions #10855

Uh oh!

Conversation

jeffbolznv commented Dec 16, 2024

Uh oh!

0cc4m left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants