]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
CUDA: enable FA for FP32 KV cache (#16546)
authorJohannes Gäßler <redacted>
Tue, 14 Oct 2025 12:22:47 +0000 (14:22 +0200)
committerGitHub <redacted>
Tue, 14 Oct 2025 12:22:47 +0000 (14:22 +0200)
commit9c7185dd28416cf67f5e3b268381f311b5e3da56
tree9b17d81011db6b30734ab854d4d1f5e7b61f49b5
parent1ee9d0b415cdf5240418c110a18b419f4002b154
CUDA: enable FA for FP32 KV cache (#16546)
ggml/src/ggml-cuda/fattn-vec.cuh
ggml/src/ggml-cuda/fattn.cu