]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
CUDA: use fastdiv + ggml_cuda_mad for mmvf (llama/16557)
authorAman Gupta <redacted>
Tue, 14 Oct 2025 11:16:21 +0000 (19:16 +0800)
committerGeorgi Gerganov <redacted>
Tue, 14 Oct 2025 19:07:44 +0000 (22:07 +0300)
commitb8774a9fc2926adf15b37b37b05cb146f2acbf6c
tree71c5b6907ddf095e24a964feef7b59c39f03d10e
parentde71a099b784f9a3761c088b3491faeb0a6321b2
CUDA: use fastdiv + ggml_cuda_mad for mmvf (llama/16557)

* CUDA: use fastdiv + ggml_cuda_mad for mmvf

* use bf16 directly + fix formatting

* Add exception for HIP code
src/ggml-cuda/mmvf.cu