]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
CUDA: limit number of FA stream-k CUDA blocks (#20586)
authorJohannes Gäßler <redacted>
Sun, 15 Mar 2026 17:30:47 +0000 (18:30 +0100)
committerGitHub <redacted>
Sun, 15 Mar 2026 17:30:47 +0000 (18:30 +0100)
commitae40cd27c85aa30b9cd56033da1d6a954290f7ea
tree30d47911701111b54010262b94d7a194bee3e67c
parentceef6b5233c3b31f454632c48fb42af16944bc5b
CUDA: limit number of FA stream-k CUDA blocks (#20586)
ggml/src/ggml-cuda/fattn-common.cuh