Slow compilation #846
Answered
by
ikawrakow
wallentri88
asked this question in
Q&A
-
Default compile options compile ALL quants:
etc etc Is there's a way to limit it to quants I actually use? I only use IQ4_K and IQ5_K ones. I feel unused quants are just wasting compilation time :( |
Beta Was this translation helpful? Give feedback.
Answered by
ikawrakow
Oct 20, 2025
Replies: 1 comment
-
I know, it is painful when the CUDA mmq and/or mmvq kernels need to be rebuild. There is currently no way to select a subset of the quantization types. You can add a feature request if you like. |
Beta Was this translation helpful? Give feedback.
0 replies
Answer selected by
wallentri88
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
I know, it is painful when the CUDA mmq and/or mmvq kernels need to be rebuild. There is currently no way to select a subset of the quantization types. You can add a feature request if you like.