forked from ggml-org/llama.cpp
-
Notifications
You must be signed in to change notification settings - Fork 588
Closed
Description
There is a bug in Vulkan on AMD cards and it will probably stay until AMD fixes the drivers - maybe someday, maybe never.
I propose to put notes that Q8 does not work properly with AMD with Vulkan. Other Quants with L (Q8_0 for embed and output weights) work without any problems.
I haven't noticed any problems with other quants, new fixes improved performance by 2-3 t/s, nice!
Radeon 6900xt
koboldcpp-1.87
Windows 10 with new AMD drivers
Metadata
Metadata
Assignees
Labels
No labels