]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
Fix FlashAttention debug test, FP32 assert (llama/7684)
authorJohannes Gäßler <redacted>
Sat, 1 Jun 2024 21:26:10 +0000 (23:26 +0200)
committerGeorgi Gerganov <redacted>
Sat, 15 Jun 2024 19:05:47 +0000 (22:05 +0300)
commit224096fb1bdb37e175de5a33d959e1826bbddcc9
tree962d9c7647f3462697a39510bc8228e04042b0e3
parent655a5b4dddacb33a04adc978163f1799ba2b6e9c
Fix FlashAttention debug test, FP32 assert (llama/7684)
src/ggml-cuda/fattn-vec-f32.cuh
tests/test-backend-ops.cpp