]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
CUDA: no FP16 arithmetic for vector FA kernel (#17558)
authorJohannes Gäßler <redacted>
Fri, 28 Nov 2025 09:29:09 +0000 (10:29 +0100)
committerGitHub <redacted>
Fri, 28 Nov 2025 09:29:09 +0000 (10:29 +0100)
commit73955f7d2a3ce1f36d7ecc14495e08957b51d113
tree82891a54e661aa730a310758e80e5d2302fcdc6d
parent35cf8887e119eb9b9f090349129e1e71a9eb608b
CUDA: no FP16 arithmetic for vector FA kernel (#17558)
ggml/src/ggml-cuda/common.cuh
ggml/src/ggml-cuda/fattn-common.cuh
ggml/src/ggml-cuda/fattn-vec.cuh