]>
git.djapps.eu Git - pkg/ggml/sources/ggml/commit
ggml-cpu: aarm64: q4_K repack gemm and gemv implementations (dotprod only) (llama/17494)
* Enabled q4_K_4x8 path
* Fixed generic Q4_K 8x4 implementation
* wip: dotprod gemm
* Working arm q4_K dotprod gemm
Signed-off-by: Alberto Cabrera <redacted>
* Undo acc rename
Signed-off-by: Alberto Cabrera <redacted>
* Q4_K arm dotprod gemm
Signed-off-by: Alberto Cabrera <redacted>
* Fix: q4_qs reinterpret from uint to int
Signed-off-by: Alberto Cabrera <redacted>
* Removed comments
* Fixed macro guards
* Fixed unused vars in generic implementation
* Fixed unused vars in 8x4 repack
* Fixed unused vars in generic implementation, unneeded comment
* Missing arch fallback for x86
* minor : style
---------
Signed-off-by: Alberto Cabrera <redacted>
Co-authored-by: Georgi Gerganov <redacted>