]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
Adding IQ2_S and IQ2_M to complete coverage of the 2-3 bit quantization range (llama...
authorKawrakow <redacted>
Mon, 26 Feb 2024 16:28:38 +0000 (18:28 +0200)
committerGeorgi Gerganov <redacted>
Wed, 28 Feb 2024 11:00:29 +0000 (13:00 +0200)
commit7b1d8ea7e0ae7a68fbea86201a0d6a026a1fcf59
tree90bb1da2eb7c3be87d717d1bcc6b2871321510d3
parentb1f7223a0a306093d188bd14b42d380fa31d41d2
Adding IQ2_S and IQ2_M to complete coverage of the 2-3 bit quantization range (llama/5721)

* Adding IQ2_S and IQ2_M as a single cumulative commit

* Update examples/quantize/quantize.cpp

Co-authored-by: Georgi Gerganov <redacted>
---------

Co-authored-by: Iwan Kawrakow <redacted>
Co-authored-by: Georgi Gerganov <redacted>
ggml-cuda.cu
ggml-metal.m
ggml-metal.metal
ggml-quants.c
ggml-quants.h
ggml.c
ggml.h