]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
ggml-cpu: FA split across kv for faster TG (llama/19209)
authorAman Gupta <redacted>
Mon, 2 Feb 2026 17:19:55 +0000 (01:19 +0800)
committerGeorgi Gerganov <redacted>
Sun, 8 Feb 2026 07:29:10 +0000 (09:29 +0200)
commit871063016d1f72a74a55a0cc5e0db485aba8f74e
treeea87cbd00580307d53dbe3e334dbf9522c5ef21b
parentc4003da2b838a923eeb7879f91dc6ddd1f413af2
ggml-cpu: FA split across kv for faster TG (llama/19209)

* ggml-cpu: split across kv for faster TG

* simplify sinks application

* add ref impl
ggml/include/ggml-cpu.h
ggml/src/ggml-cpu/ggml-cpu-impl.h
ggml/src/ggml-cpu/ggml-cpu.c
ggml/src/ggml-cpu/ggml-cpu.cpp
ggml/src/ggml-cpu/ops.cpp