]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
gguf-py : add Numpy MXFP4 de/quantization support (llama/15111)
authorcompilade <redacted>
Fri, 8 Aug 2025 21:48:26 +0000 (17:48 -0400)
committerGeorgi Gerganov <redacted>
Thu, 14 Aug 2025 11:17:28 +0000 (14:17 +0300)
commita38c4bc4109aae4a203f062f3a4cb36317de1b57
tree76f1e68d1eed7bf70f2dfb1f02ef64ed12d39035
parentbd3d3fd04429f0453ab3ee1b321b1d34c2133798
gguf-py : add Numpy MXFP4 de/quantization support (llama/15111)

* gguf-py : add MXFP4 de/quantization support

* ggml-quants : handle zero amax for MXFP4
src/ggml-quants.c