]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
CUDA: fix race conditions FlashAttention kernels (llama/13438)
authorJohannes Gäßler <redacted>
Sat, 10 May 2025 20:22:48 +0000 (22:22 +0200)
committerGeorgi Gerganov <redacted>
Tue, 13 May 2025 10:02:19 +0000 (13:02 +0300)
commit38648430fce1422694f2f349a5fe60d5969d6f49
tree174126d040f1fda95211f5699c12fe207d275590
parent637981b2afd5c0d23eddc14799b314692c839453
CUDA: fix race conditions FlashAttention kernels (llama/13438)
src/ggml-cuda/fattn-mma-f16.cuh
src/ggml-cuda/fattn-vec-f16.cuh