]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
CUDA: fix FlashAttention on Turing (llama/13415)
authorJohannes Gäßler <redacted>
Sat, 10 May 2025 07:16:52 +0000 (09:16 +0200)
committerGeorgi Gerganov <redacted>
Tue, 13 May 2025 10:59:21 +0000 (13:59 +0300)
commit16f3546f38cf4d58cd915c1066b86edac5fc4fef
tree0cd00ac708c1ec0ef05ad57b471e5a25c8397806
parenta04b329ad172fa3c24d10cb106aa9b7fffb7e511
CUDA: fix FlashAttention on Turing (llama/13415)
ggml/src/ggml-cuda/fattn-mma-f16.cuh