]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
CUDA: skip masked KV slices for all FA kernels (llama/14924)
authorJohannes Gäßler <redacted>
Wed, 30 Jul 2025 13:46:13 +0000 (15:46 +0200)
committerGeorgi Gerganov <redacted>
Mon, 18 Aug 2025 17:30:45 +0000 (20:30 +0300)
commit113d88686b7fcd56a506bd601686c43168cbb61f
tree6b4218c0e298f59be4aef24a67abd0d260f0508f
parent4e624e42faf73a7ea840abbf16f3c9b2234d3af4
CUDA: skip masked KV slices for all FA kernels (llama/14924)
ggml/src/ggml-cuda/common.cuh
ggml/src/ggml-cuda/fattn-common.cuh
ggml/src/ggml-cuda/fattn-mma-f16.cuh
ggml/src/ggml-cuda/fattn-tile-f16.cu
ggml/src/ggml-cuda/fattn-tile-f32.cu
ggml/src/ggml-cuda/fattn-vec-f16.cuh
ggml/src/ggml-cuda/fattn-vec-f32.cuh
ggml/src/ggml-cuda/fattn-wmma-f16.cu
ggml/src/ggml-cuda/fattn.cu