]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16 (#15131)
authorJohannes Gäßler <redacted>
Thu, 7 Aug 2025 08:53:21 +0000 (10:53 +0200)
committerGitHub <redacted>
Thu, 7 Aug 2025 08:53:21 +0000 (10:53 +0200)
commit1d72c841888b9450916bdd5a9b3274da380f5b36
treea3d99579153dd832a8979b1604150983c1f2431f
parent20638e4f16fcc21f836c7556e83bbf532bb5a0f0
CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16 (#15131)

* CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16
15 files changed:
ggml/src/ggml-cuda/common.cuh
ggml/src/ggml-cuda/fattn-mma-f16.cuh
ggml/src/ggml-cuda/fattn.cu
ggml/src/ggml-cuda/ggml-cuda.cu
ggml/src/ggml-cuda/mma.cuh
ggml/src/ggml-cuda/mmf.cu [new file with mode: 0644]
ggml/src/ggml-cuda/mmf.cuh [new file with mode: 0644]
ggml/src/ggml-cuda/mmq.cu
ggml/src/ggml-cuda/mmq.cuh
ggml/src/ggml-cuda/mmv.cu [deleted file]
ggml/src/ggml-cuda/mmv.cuh [deleted file]
ggml/src/ggml-cuda/mmvf.cu [new file with mode: 0644]
ggml/src/ggml-cuda/mmvf.cuh [new file with mode: 0644]
ggml/src/ggml-cuda/vendors/hip.h
ggml/src/ggml-cuda/vendors/musa.h