]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
CUDA: tune GLM 4.7 Flash FA kernel selection logic (llama/19097)
authorJohannes Gäßler <redacted>
Tue, 27 Jan 2026 13:28:56 +0000 (14:28 +0100)
committerGeorgi Gerganov <redacted>
Fri, 30 Jan 2026 11:49:29 +0000 (13:49 +0200)
commit0dff0d5a9cd6ffca1b54ad17a0194e21e1e676a0
tree4f65000da9d747a9c8e9cdf796675bda1ba32127
parent8dd386b0d91905fad71a142a4f078cb67f87d668
CUDA: tune GLM 4.7 Flash FA kernel selection logic (llama/19097)
src/ggml-cuda/fattn.cu