]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
gguf-py : add Numpy MXFP4 de/quantization support (llama/15111)
authorcompilade <redacted>
Fri, 8 Aug 2025 21:48:26 +0000 (17:48 -0400)
committerGeorgi Gerganov <redacted>
Mon, 18 Aug 2025 17:30:45 +0000 (20:30 +0300)
commit62566a54365795cd509d8075cd2ea706d491d72f
treece7409cd2e767c6984c6e4a2ddb02faf49173643
parent573bf9d12804dc1352fd59d15b65399b26df6375
gguf-py : add Numpy MXFP4 de/quantization support (llama/15111)

* gguf-py : add MXFP4 de/quantization support

* ggml-quants : handle zero amax for MXFP4
ggml/src/ggml-quants.c