Skip to content

Commit ecf111a

Browse files
ikawrakowIwan Kawrakow
andauthored
Deepseek-Lite (#184)
* Quantization mixes tweaks * Make iq4_nl_r4 work with row size that are not a multiple of 128 ... on Zen4 * Make iq4_nl_r4 work with row size that are not a multiple of 128 ... on AVX2 * Make iq4_nl_r4 work with row size that are not a multiple of 128 ... on AVX2 * Make q6_0_w4 work with row size that are not a multiple of 128 ... on Zen4 * Make q6_0_w4 work with row size that are not a multiple of 128 ... on Zen4 * Make q5_0_r4 work with row size that are not a multiple of 128 ... on Zen4 and AVX2 * Make q5,6_0_r4, iq4_nl_e4 work with row size that are not a multiple of 128 also on NEON. --------- Co-authored-by: Iwan Kawrakow <[email protected]>
1 parent 2e6b523 commit ecf111a

File tree

2 files changed

+315
-170
lines changed

2 files changed

+315
-170
lines changed

0 commit comments

Comments
 (0)