]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
ggml-cpu: Support Q5_0 and Q5_1 on s390x (llama/15486)
authorAaron Teo <redacted>
Fri, 22 Aug 2025 08:11:04 +0000 (16:11 +0800)
committerGeorgi Gerganov <redacted>
Sat, 20 Sep 2025 10:42:39 +0000 (13:42 +0300)
commit380d3db21638a6bb07bf6c52f5218b1a66b2cde4
tree040f47118f6f27181218c5f36ebea51f63624450
parentbe841c3f6e72fb520ff2ce9c32832bdb6ab4290e
ggml-cpu: Support Q5_0 and Q5_1 on s390x (llama/15486)

* ggml-cpu: initial q5_0 impl for s390x

Signed-off-by: Aaron Teo <redacted>
* ggml-cpu: updated q5_0 code for better performance

Signed-off-by: Aaron Teo <redacted>
* ggml-cpu: use optimised hsum for better performance

Signed-off-by: Aaron Teo <redacted>
* ggml-cpu: introduce q5_1 simd + refactor q5_0

Signed-off-by: Aaron Teo <redacted>
* ggml-cpu: fix incorrect return type vec_hsum

Signed-off-by: Aaron Teo <redacted>
* ggml-cpu: q5_0 incomplete refactor + table_b2b_0 activation

Signed-off-by: Aaron Teo <redacted>
* ggml-cpu: refactor q5_1

Signed-off-by: Aaron Teo <redacted>
* ggml-cpu: q5_1 update loop unroll to 4

Signed-off-by: Aaron Teo <redacted>
* ggml-cpu: update q5_0 unroll to 4

Signed-off-by: Aaron Teo <redacted>
* ggml-cpu: update build-s390x docs

Signed-off-by: Aaron Teo <redacted>
* ggml-cpu: update unused variables q5_0

Signed-off-by: Aaron Teo <redacted>
* docs: update the last update date

Signed-off-by: Aaron Teo <redacted>
---------

Signed-off-by: Aaron Teo <redacted>
ggml/src/ggml-cpu/arch-fallback.h
ggml/src/ggml-cpu/arch/s390/quants.c
ggml/src/ggml-cpu/ggml-cpu-impl.h