]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
CUDA: fix Volta FlashAttention logic (#11615)
authorJohannes Gäßler <redacted>
Mon, 3 Feb 2025 12:25:56 +0000 (13:25 +0100)
committerGitHub <redacted>
Mon, 3 Feb 2025 12:25:56 +0000 (14:25 +0200)
commit21c84b5d2dc04050714567501bf78762bfa17846
treea244136aed1bdb8e2bad850f0832358258bc597d
parentd92cb67e37abc23b1c6f7b0ef27a9889da8537e3
CUDA: fix Volta FlashAttention logic (#11615)
ggml/src/ggml-cuda/fattn-wmma-f16.cu
ggml/src/ggml-cuda/fattn.cu