]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
CUDA/HIP: refractor mmqv to unify the calculation of nwarps and rows per block betwee...
authoruvos <redacted>
Tue, 11 Mar 2025 19:16:03 +0000 (20:16 +0100)
committerGitHub <redacted>
Tue, 11 Mar 2025 19:16:03 +0000 (20:16 +0100)
commit10f2e81809bbb69ecfe64fc8b4686285f84b0c07
treec5db03e3fe146c5b32b3d752654f508483ad809e
parentba7654380a3c7c1b5ae154bea19134a3a9417a1e
CUDA/HIP: refractor mmqv to unify the calculation of nwarps and rows per block between host and device code. (#12177)

refactor mmqv to unify the calculation of nwarps and rows per block between host and device code.

---------

Co-authored-by: Johannes Gäßler <redacted>
ggml/src/ggml-cuda/common.cuh
ggml/src/ggml-cuda/mmvq.cu