You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
*Major T/s improvement* Use the Metal qmatmul MM kernels (#2615)
* Add GGUF BF16 support (#17)
* Add GGUF bf16 type support
* Add non avx impl for vec_dot_bf16
* Fix from_u32
* Fix loading
* Fix dequant of bf16
* Update kernels for metal bf16 (#19)
* Update kernels for metal bf16
* Fix typo
* Check if have bfloat
* Sync ggml metal kernels (#33)
* Metal qmatmul mat-mat product (#39)
* Test passes
* All tests pass
* Now all the tests really pass
* Try out always using mm
* Mirror llama.cpp metric
* Mirror llama.cpp metric
* Update test
* Update test
* fixed merge error
---------
Co-authored-by: keighbee <[email protected]>
0 commit comments