git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Kawrakow <redacted>
	Mon, 26 Feb 2024 16:28:38 +0000 (18:28 +0200)
committer	GitHub <redacted>
	Mon, 26 Feb 2024 16:28:38 +0000 (18:28 +0200)
commit	a33e6a0d2a66104ea9a906bdbf8a94d050189d91
tree	30478b4a0b1792d1af66c5d64e2c3c4fa1af74ab	tree
parent	47bb7b48c7cec9d8f57d56812ce811ec130b89a3	commit \| diff

Adding IQ2_S and IQ2_M to complete coverage of the 2-3 bit quantization range (#5721)

* Adding IQ2_S and IQ2_M as a single cumulative commit

* Update examples/quantize/quantize.cpp

Co-authored-by: Georgi Gerganov <redacted>
---------

Co-authored-by: Iwan Kawrakow <redacted>
Co-authored-by: Georgi Gerganov <redacted>

12 files changed:

examples/quantize/quantize.cpp		diff \| blob \| history
ggml-cuda.cu		diff \| blob \| history
ggml-metal.m		diff \| blob \| history
ggml-metal.metal		diff \| blob \| history
ggml-quants.c		diff \| blob \| history
ggml-quants.h		diff \| blob \| history
ggml.c		diff \| blob \| history
ggml.h		diff \| blob \| history
llama.cpp		diff \| blob \| history
llama.h		diff \| blob \| history
tests/test-backend-ops.cpp		diff \| blob \| history
tests/test-quantize-fns.cpp		diff \| blob \| history

Packaging of ggml-org/llama.cpp

RSS Atom