]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
CUDA/HIP: refractor mmqv to unify the calculation of nwarps and rows per block betwee...
authoruvos <redacted>
Tue, 11 Mar 2025 19:16:03 +0000 (20:16 +0100)
committerGeorgi Gerganov <redacted>
Thu, 27 Mar 2025 07:35:24 +0000 (09:35 +0200)
commitc03f1a5ef0b4c57fcc3bbb0205349de38e35f2d4
treed0335d7fb0606def23386965136e27decc6bf112
parentee038fa8004863a6a63fa09eb01c65da81cc106a
CUDA/HIP: refractor mmqv to unify the calculation of nwarps and rows per block between host and device code. (llama/12177)

refactor mmqv to unify the calculation of nwarps and rows per block between host and device code.

---------

Co-authored-by: Johannes Gäßler <redacted>
src/ggml-cuda/common.cuh
src/ggml-cuda/mmvq.cu