]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
ggml-cpu: aarm64: q4_K repack gemm and gemv implementations (dotprod only) (llama...
authorAlberto Cabrera Pérez <redacted>
Thu, 27 Nov 2025 11:25:14 +0000 (11:25 +0000)
committerGeorgi Gerganov <redacted>
Fri, 12 Dec 2025 15:53:10 +0000 (17:53 +0200)
commit93f6cdb9c060a52d2ebb91e6a14e3a97bac98086
treec313020d2fb2649f7cb7bff6d76d954c777a3def
parentac92424b5903dd3f7697d1717c1352cc11d37861
ggml-cpu: aarm64: q4_K repack gemm and gemv implementations (dotprod only) (llama/17494)

* Enabled q4_K_4x8 path

* Fixed generic Q4_K 8x4 implementation

* wip: dotprod gemm

* Working arm q4_K dotprod gemm

Signed-off-by: Alberto Cabrera <redacted>
* Undo acc rename

Signed-off-by: Alberto Cabrera <redacted>
* Q4_K arm dotprod gemm

Signed-off-by: Alberto Cabrera <redacted>
* Fix: q4_qs reinterpret from uint to int

Signed-off-by: Alberto Cabrera <redacted>
* Removed comments

* Fixed macro guards

* Fixed unused vars in generic implementation

* Fixed unused vars in 8x4 repack

* Fixed unused vars in generic implementation, unneeded comment

* Missing arch fallback for x86

* minor : style

---------

Signed-off-by: Alberto Cabrera <redacted>
Co-authored-by: Georgi Gerganov <redacted>
ggml/src/ggml-cpu/arch-fallback.h
ggml/src/ggml-cpu/arch/arm/repack.cpp
ggml/src/ggml-cpu/repack.cpp
ggml/src/ggml-cpu/repack.h