]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
CUDA/HIP: refractor mmqv to unify the calculation of nwarps and rows per block betwee...
authoruvos <redacted>
Tue, 11 Mar 2025 19:16:03 +0000 (20:16 +0100)
committerGeorgi Gerganov <redacted>
Thu, 27 Mar 2025 09:06:03 +0000 (11:06 +0200)
commit394fae57c33a5d8489dca93fe71df73d4754a8ac
tree2addde5ef672e0fcdeef5e7f42ebc223ff76ba80
parent07088353013e1d4f63e681307cdfd5555b8fd03f
CUDA/HIP: refractor mmqv to unify the calculation of nwarps and rows per block between host and device code. (llama/12177)

refactor mmqv to unify the calculation of nwarps and rows per block between host and device code.

---------

Co-authored-by: Johannes Gäßler <redacted>
ggml/src/ggml-cuda/common.cuh
ggml/src/ggml-cuda/mmvq.cu