]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
CUDA: re-use MLA K data for V in MMA FA (llama/19057)
authorJohannes Gäßler <redacted>
Sat, 24 Jan 2026 09:09:36 +0000 (10:09 +0100)
committerGeorgi Gerganov <redacted>
Fri, 30 Jan 2026 13:56:40 +0000 (15:56 +0200)
commitf53eafd74557792e68719a75cd2cd1b205862f88
tree0b80163b154c7ee7b8074f1f80bfc1fc3c735625
parent13577a6ce4496aa3857dc6c878a4029c05ed7e69
CUDA: re-use MLA K data for V in MMA FA (llama/19057)
ggml/src/ggml-cuda/fattn-common.cuh
ggml/src/ggml-cuda/fattn-mma-f16.cuh
ggml/src/ggml-cuda/fattn.cu