]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
CUDA: only allocate FA tmp buffer if needed (llama/18564)
authorJohannes Gäßler <redacted>
Sat, 3 Jan 2026 12:55:53 +0000 (13:55 +0100)
committerGeorgi Gerganov <redacted>
Wed, 14 Jan 2026 07:11:59 +0000 (09:11 +0200)
commit60d178cee95d935922813efeab86bcde6d914472
tree5d4c62d31a7acc5b03dad78ded3d7f934ef42099
parent304e780e5f508b702ae5b6b0bdc4f3d7d892f075
CUDA: only allocate FA tmp buffer if needed (llama/18564)
ggml/src/ggml-cuda/fattn-common.cuh