]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
ggml : importance matrix support for legacy quants (llama/4969)
authorKawrakow <redacted>
Tue, 16 Jan 2024 17:51:26 +0000 (19:51 +0200)
committerGeorgi Gerganov <redacted>
Wed, 17 Jan 2024 18:44:10 +0000 (20:44 +0200)
commit006c363a1a151be7cedec108ef4e17425385ad42
tree3f586bb07ab585ba5e159531afdc36b01bd425bb
parentfd8267ab59b06ee924676fb571fe7048bc58069e
ggml : importance matrix support for legacy quants (llama/4969)

* imatrix: adding support for legacy quants

* imatrix: guard Q4_0/Q5_0 against ffn_down craziness

---------

Co-authored-by: Iwan Kawrakow <redacted>
src/ggml-quants.c
src/ggml-quants.h
src/ggml.c