]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
ggml : fix quant dot product with odd number of blocks (#8549)
authorslaren <redacted>
Fri, 19 Jul 2024 15:17:27 +0000 (17:17 +0200)
committerGitHub <redacted>
Fri, 19 Jul 2024 15:17:27 +0000 (17:17 +0200)
commit87e397d00bdcedd5cbf6dfda06a7b0f302462728
tree2c702009fc5cc31332781d145c2aa4b2ed7219c0
parent57b1d4f9eb6f2c139b31ea79626d954b261e1051
ggml : fix quant dot product with odd number of blocks (#8549)

* ggml : fix iq4_nl dot product with odd number of blocks

* ggml : fix odd blocks for ARM_NEON (#8556)

* ggml : fix iq4_nl dot product with odd number of blocks

* ggml : fix q4_1

* ggml : fix q5_0

* ggml : fix q5_1

* ggml : fix iq4_nl metal

ggml-ci

* ggml : fix q4_0

* ggml : fix q8_0

ggml-ci

* ggml : remove special Q4_0 code for first 2 blocks

* ggml : fix sumf redefinition

---------

Co-authored-by: slaren <redacted>
---------

Co-authored-by: Georgi Gerganov <redacted>
ggml/src/ggml-metal.m
ggml/src/ggml-metal.metal
ggml/src/ggml-quants.c
tests/test-backend-ops.cpp