]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
Adding IQ2_S and IQ2_M to complete coverage of the 2-3 bit quantization range (llama...
authorKawrakow <redacted>
Mon, 26 Feb 2024 16:28:38 +0000 (18:28 +0200)
committerGeorgi Gerganov <redacted>
Wed, 28 Feb 2024 09:18:31 +0000 (11:18 +0200)
commit55593c014f9ba448d287bc0e07a387b07d88edeb
tree9f968d8dc3b8a17e2da018271dca04282f35abfa
parentb4a811aca8921e6b40b66905caa0a90250c4408a
Adding IQ2_S and IQ2_M to complete coverage of the 2-3 bit quantization range (llama/5721)

* Adding IQ2_S and IQ2_M as a single cumulative commit

* Update examples/quantize/quantize.cpp

Co-authored-by: Georgi Gerganov <redacted>
---------

Co-authored-by: Iwan Kawrakow <redacted>
Co-authored-by: Georgi Gerganov <redacted>
include/ggml/ggml.h
src/ggml-cuda.cu
src/ggml-metal.m
src/ggml-metal.metal
src/ggml-quants.c
src/ggml-quants.h
src/ggml.c
tests/test-backend-ops.cpp
tests/test-quantize-fns.cpp