]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
CUDA: skip masked KV slices for all FA kernels (#14924)
authorJohannes Gäßler <redacted>
Wed, 30 Jul 2025 13:46:13 +0000 (15:46 +0200)
committerGitHub <redacted>
Wed, 30 Jul 2025 13:46:13 +0000 (15:46 +0200)
commit92b8810ec7aa6d778bc287cc918443cf67b962e2
tree43fd8bc15d0b9fdc823fb1357611124fec4d9f4a
parent00131d6eaf4df029e1ec84de868c2c5957503007
CUDA: skip masked KV slices for all FA kernels (#14924)
ggml/src/ggml-cuda/common.cuh
ggml/src/ggml-cuda/fattn-common.cuh
ggml/src/ggml-cuda/fattn-mma-f16.cuh
ggml/src/ggml-cuda/fattn-tile-f16.cu
ggml/src/ggml-cuda/fattn-tile-f32.cu
ggml/src/ggml-cuda/fattn-vec-f16.cuh
ggml/src/ggml-cuda/fattn-vec-f32.cuh
ggml/src/ggml-cuda/fattn-wmma-f16.cu
ggml/src/ggml-cuda/fattn.cu