]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
CUDA: fix overflow in FA, tune performance (llama/14840)
authorJohannes Gäßler <redacted>
Wed, 23 Jul 2025 19:43:25 +0000 (21:43 +0200)
committerGeorgi Gerganov <redacted>
Thu, 24 Jul 2025 17:57:40 +0000 (20:57 +0300)
commitd589d2d203a93f70302a044edcf11fde2eb93547
tree15702a6d85336777d1e1729f7d9edfbad50bacbd
parente64ed53d574ba802fcc6e0976852bf8c4f8c1495
CUDA: fix overflow in FA, tune performance (llama/14840)
src/ggml-cuda/fattn-common.cuh
src/ggml-cuda/fattn-mma-f16.cuh
src/ggml-cuda/fattn-tile-f16.cu
src/ggml-cuda/fattn-tile-f32.cu
src/ggml-cuda/fattn-vec-f16.cuh
src/ggml-cuda/fattn-vec-f32.cuh
src/ggml-cuda/fattn-wmma-f16.cu
src/ggml-cuda/fattn.cu