]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
CUDA: skip fully masked-out KV in FA vec kernel (#13584)
authorJohannes Gäßler <redacted>
Tue, 20 May 2025 12:45:07 +0000 (14:45 +0200)
committerGitHub <redacted>
Tue, 20 May 2025 12:45:07 +0000 (14:45 +0200)
commitb69f1647f9953cb3773266d2c83a92fd0e7e6d66
treebafba1bed6b426ea42e4d11c952af999ca28840a
parent759e37b0d89bc4bd1bce860dc5f3c3052e08575c
CUDA: skip fully masked-out KV in FA vec kernel (#13584)

* CUDA: skip fully masked-out KV in FA vec kernel
ggml/src/ggml-cuda/fattn-vec-f16.cuh
ggml/src/ggml-cuda/fattn-vec-f32.cuh