]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
CUDA: fix race conditions FlashAttention kernels (#13438)
authorJohannes Gäßler <redacted>
Sat, 10 May 2025 20:22:48 +0000 (22:22 +0200)
committerGitHub <redacted>
Sat, 10 May 2025 20:22:48 +0000 (22:22 +0200)
commit0208355f42bdab88a08507ead4a6302790a08323
tree9ab2592eb02973016e66a67f3d68022f29ec5a0a
parentd2a4ef05c60506ee48e7375eb36f2257de7ab0d2
CUDA: fix race conditions FlashAttention kernels (#13438)
ggml/src/ggml-cuda/fattn-mma-f16.cuh
ggml/src/ggml-cuda/fattn-vec-f16.cuh