]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
cpu: introduce chunking for repack matmuls and enable matmul-id chunking on ARM64...
authorMax Krasnyansky <redacted>
Thu, 30 Oct 2025 16:06:13 +0000 (09:06 -0700)
committerGeorgi Gerganov <redacted>
Sun, 9 Nov 2025 21:38:03 +0000 (23:38 +0200)
commitffe1c832bd6a4b576d04aab2b5a1b042e621a2eb
treeda257a8db6427f3a3dbc430140b61aedd1eb7b52
parente1780b209ddeb194b14a7ba7d1e0d77a31caf494
cpu: introduce chunking for repack matmuls and enable matmul-id chunking on ARM64 (llama/16833)

Very similar implementation to the flash-attention chunking, with similar benefits.
ggml/src/ggml-cpu/ggml-cpu.c
ggml/src/ggml-cpu/repack.cpp