]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
ggml-cpu: Support Q5_0 and Q5_1 on s390x (llama/15486)
authorAaron Teo <redacted>
Fri, 22 Aug 2025 08:11:04 +0000 (16:11 +0800)
committerGeorgi Gerganov <redacted>
Fri, 5 Sep 2025 09:54:01 +0000 (12:54 +0300)
commit135ff302bbb16fecbdecb94ff9d66f28aa7da0e3
tree6ca9449c103a1b9c8c0c3c0fea26c2a21640b55c
parent216254c95de9ae72b678505dc780a39d8b7b1eb6
ggml-cpu: Support Q5_0 and Q5_1 on s390x (llama/15486)

* ggml-cpu: initial q5_0 impl for s390x

Signed-off-by: Aaron Teo <redacted>
* ggml-cpu: updated q5_0 code for better performance

Signed-off-by: Aaron Teo <redacted>
* ggml-cpu: use optimised hsum for better performance

Signed-off-by: Aaron Teo <redacted>
* ggml-cpu: introduce q5_1 simd + refactor q5_0

Signed-off-by: Aaron Teo <redacted>
* ggml-cpu: fix incorrect return type vec_hsum

Signed-off-by: Aaron Teo <redacted>
* ggml-cpu: q5_0 incomplete refactor + table_b2b_0 activation

Signed-off-by: Aaron Teo <redacted>
* ggml-cpu: refactor q5_1

Signed-off-by: Aaron Teo <redacted>
* ggml-cpu: q5_1 update loop unroll to 4

Signed-off-by: Aaron Teo <redacted>
* ggml-cpu: update q5_0 unroll to 4

Signed-off-by: Aaron Teo <redacted>
* ggml-cpu: update build-s390x docs

Signed-off-by: Aaron Teo <redacted>
* ggml-cpu: update unused variables q5_0

Signed-off-by: Aaron Teo <redacted>
* docs: update the last update date

Signed-off-by: Aaron Teo <redacted>
---------

Signed-off-by: Aaron Teo <redacted>
src/ggml-cpu/arch-fallback.h
src/ggml-cpu/arch/s390/quants.c
src/ggml-cpu/ggml-cpu-impl.h