]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
ggml : Q2k interleaving implementation - x86/x64 SIMD (llama/14373)
authorSrihari-mcw <redacted>
Fri, 1 Aug 2025 06:20:33 +0000 (11:50 +0530)
committerGeorgi Gerganov <redacted>
Mon, 18 Aug 2025 17:30:45 +0000 (20:30 +0300)
commit1c6cb7df475f72db1c5574d193cccd105948d76b
treea504078e75f0bd6e06f54815b376a566f2dae885
parent78668cb8d1c79e296b85b06caa53cd78c2d8f4bb
ggml : Q2k interleaving implementation - x86/x64 SIMD (llama/14373)

* Initial Q2_K Block Interleaving Implementation

* Addressed review comments and clean up of the code

* Post rebase fixes

* Initial CI/CD fixes

* Update declarations in arch-fallback.h

* Changes for GEMV Q2_K in arch-fallback.h

* Enable repacking only on AVX-512 machines

* Update comments in repack.cpp

* Address q2k comments

---------

Co-authored-by: Manogna-Sree <redacted>
ggml/src/ggml-cpu/arch-fallback.h
ggml/src/ggml-cpu/arch/x86/repack.cpp
ggml/src/ggml-cpu/repack.cpp
ggml/src/ggml-cpu/repack.h