Question about v1.29's no CLBLAST for K-quants #219
Replies: 3 comments 5 replies
-
That is correct. Until the dequantize and mat_mul kernels are implemented, K-quants will not work with CLBlast. You can also follow this issue ggml-org#1725 Someone will have to come up with the implementation as I am not familiar enough with CL kernels to do them correctly. I'm not sure why the person who added k-quants didn't add a CL implementation as well. |
Beta Was this translation helpful? Give feedback.
-
If anyone is able to write CL kernels, contributions (either here or preferably upstream) are greatly welcome |
Beta Was this translation helpful? Give feedback.
-
Implemented in newest version. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
So I understand that the new K-quantization is only supported by openblas for now. Without CLblast does that also mean no GPU layers offloading?
Beta Was this translation helpful? Give feedback.
All reactions