]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
CUDA: limit number of FA stream-k CUDA blocks (llama/20586)
authorJohannes Gäßler <redacted>
Sun, 15 Mar 2026 17:30:47 +0000 (18:30 +0100)
committerGeorgi Gerganov <redacted>
Mon, 16 Mar 2026 11:10:15 +0000 (13:10 +0200)
commitd7926e62d40e9ac9e5e9610421564f2506f10d1a
tree69703be6e3c393ce5b70bde0fb36e918786ba933
parent2fb6aea8ad4453dba649937e914b28d194c55c53
CUDA: limit number of FA stream-k CUDA blocks (llama/20586)
ggml/src/ggml-cuda/fattn-common.cuh