]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
CUDA: generalized (mma) FA, add Volta support (llama/17505)
authorJohannes Gäßler <redacted>
Wed, 3 Dec 2025 15:57:05 +0000 (16:57 +0100)
committerGeorgi Gerganov <redacted>
Thu, 11 Dec 2025 13:32:53 +0000 (15:32 +0200)
commit0371845c5902ebf97aacd65958e6785d078facf7
treed3b3a6d255f457e869ebf4d6fc4a537d50664d4d
parent46604ca1f33c48cc62c4aa52e46f808c96c29e65
CUDA: generalized (mma) FA, add Volta support (llama/17505)

* CUDA: generalized (mma) FA, add Volta support

* use struct for MMA FA kernel config

---------

Co-authored-by: Aman Gupta <aman>
include/ggml.h
src/ggml-cuda/fattn-common.cuh
src/ggml-cuda/fattn-mma-f16.cuh
src/ggml-cuda/fattn-tile.cuh
src/ggml-cuda/fattn-vec.cuh
src/ggml-cuda/fattn-wmma-f16.cu
src/ggml-cuda/fattn-wmma-f16.cuh
src/ggml-cuda/fattn.cu
src/ggml-cuda/mma.cuh
src/ggml-cuda/mmf.cuh