Skip to content

b6121

Latest

Choose a tag to compare

@github-actions github-actions released this 09 Aug 04:24
e54d41b
gguf-py : add Numpy MXFP4 de/quantization support (#15111)

* gguf-py : add MXFP4 de/quantization support

* ggml-quants : handle zero amax for MXFP4