]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
ggml : fix quant dot product with odd number of blocks (llama/8549)
authorslaren <redacted>
Fri, 19 Jul 2024 15:17:27 +0000 (17:17 +0200)
committerGeorgi Gerganov <redacted>
Sat, 27 Jul 2024 15:26:12 +0000 (18:26 +0300)
commit635545fcc661cc4bf7bccd666c8dee12230e49bc
treef4321c2ae54809c272f00e28701232ba5170b7f2
parent27969b3bd39d4335ea86eb4c4d735558b14053b1
ggml : fix quant dot product with odd number of blocks (llama/8549)

* ggml : fix iq4_nl dot product with odd number of blocks

* ggml : fix odd blocks for ARM_NEON (llama/8556)

* ggml : fix iq4_nl dot product with odd number of blocks

* ggml : fix q4_1

* ggml : fix q5_0

* ggml : fix q5_1

* ggml : fix iq4_nl metal

ggml-ci

* ggml : fix q4_0

* ggml : fix q8_0

ggml-ci

* ggml : remove special Q4_0 code for first 2 blocks

* ggml : fix sumf redefinition

---------

Co-authored-by: slaren <redacted>
---------

Co-authored-by: Georgi Gerganov <redacted>
src/ggml-metal.m
src/ggml-metal.metal
src/ggml-quants.c
tests/test-backend-ops.cpp