Skip to content

Commit 6fb3dfe

Browse files
committed
Update llama-quant.cpp
1 parent 4739423 commit 6fb3dfe

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

src/llama-quant.cpp

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -212,6 +212,7 @@ static ggml_type llama_tensor_get_type(quantize_state_impl & qs, ggml_type new_t
212212
// Layers 0, 1, 2 are Dense so Q4_K
213213
// 3, 4, 5 left as Q2_K
214214
if (is_one_bit) {
215+
// 3, 4, 5, 6, 7, 8 left as 2.06 bpw
215216
if (i_layer < 9) new_type = GGML_TYPE_IQ2_XXS; // 2.06 bpw
216217
}
217218
else {

0 commit comments

Comments
 (0)