]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
CUDA: fix overflow in FA, tune performance (llama/14840)
authorJohannes Gäßler <redacted>
Wed, 23 Jul 2025 19:43:25 +0000 (21:43 +0200)
committerGeorgi Gerganov <redacted>
Mon, 28 Jul 2025 10:02:32 +0000 (13:02 +0300)
commit95efcf011d298032c12f70c99b1f232d0b3696fb
treed97a0ddb43dfb8baa0d92c07dd5628bee9840ec4
parent8272aa9f14c4bcd9762d9c202f6e3eb21d128bcd
CUDA: fix overflow in FA, tune performance (llama/14840)
ggml/src/ggml-cuda/fattn-common.cuh
ggml/src/ggml-cuda/fattn-mma-f16.cuh
ggml/src/ggml-cuda/fattn-tile-f16.cu
ggml/src/ggml-cuda/fattn-tile-f32.cu
ggml/src/ggml-cuda/fattn-vec-f16.cuh
ggml/src/ggml-cuda/fattn-vec-f32.cuh
ggml/src/ggml-cuda/fattn-wmma-f16.cu
ggml/src/ggml-cuda/fattn.cu