]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
CUDA: generalized (mma) FA, add Volta support (#17505)
authorJohannes Gäßler <redacted>
Wed, 3 Dec 2025 15:57:05 +0000 (16:57 +0100)
committerGitHub <redacted>
Wed, 3 Dec 2025 15:57:05 +0000 (16:57 +0100)
commit2e1c9cd814227c576da56379d79b15d7dfd199b2
tree4f52d881a6cb2aeb441e87eecaa3115f2895fa7e
parent190c4838bd8bfff218a32c73dad7c9b4d5c444f1
CUDA: generalized (mma) FA, add Volta support (#17505)

* CUDA: generalized (mma) FA, add Volta support

* use struct for MMA FA kernel config

---------

Co-authored-by: Aman Gupta <aman>
ggml/include/ggml.h
ggml/src/ggml-cuda/fattn-common.cuh
ggml/src/ggml-cuda/fattn-mma-f16.cuh
ggml/src/ggml-cuda/fattn-tile.cuh
ggml/src/ggml-cuda/fattn-vec.cuh
ggml/src/ggml-cuda/fattn-wmma-f16.cu
ggml/src/ggml-cuda/fattn-wmma-f16.cuh
ggml/src/ggml-cuda/fattn.cu
ggml/src/ggml-cuda/mma.cuh
ggml/src/ggml-cuda/mmf.cuh