Skip to content

Commit c75a37e

Browse files
committed
Update quantization documentation
1 parent 2f26c4b commit c75a37e

File tree

1 file changed

+1
-2
lines changed

1 file changed

+1
-2
lines changed

README.md

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -105,10 +105,9 @@ IMPORTANCE_MATRIX_DIRECTORY=.\imatrix
105105
# Hint: A very good quantization with minimal quality loss is
106106
# Q5_K_M. Quantization below 4-bit causes measurable quality
107107
# loss, try to avoid going too low and use IQ4_XS as a minimum.
108-
#
109108
# @see https://github.com/ggerganov/llama.cpp/tree/master/examples/perplexity
110109
#
111-
QUANTIZATION_TYPES=Q5_K_M,Q4_1,Q3_K_S
110+
QUANTIZATION_TYPES=Q5_K_M,IQ4_XS
112111
```
113112

114113
> [!NOTE]

0 commit comments

Comments
 (0)