]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
CUDA: mul_mat_v support for batch sizes > 1 (llama/14262)
authorJohannes Gäßler <redacted>
Mon, 23 Jun 2025 11:11:31 +0000 (13:11 +0200)
committerGeorgi Gerganov <redacted>
Tue, 1 Jul 2025 08:52:14 +0000 (11:52 +0300)
commit472b63351a45eeb1213c44a57f6bf4e31f185a85
treedffe13db14b4d1af93c651111a6f4fb0c703a8b6
parentfd949ca2d82a970d0024fceb5df76ec21839b059
CUDA: mul_mat_v support for batch sizes > 1 (llama/14262)

* CUDA: mul_mat_v support for batch sizes > 1

* use 64 bit math for initial offset calculation
src/ggml-cuda/common.cuh
src/ggml-cuda/ggml-cuda.cu
src/ggml-cuda/mmv.cu
src/ggml-cuda/mmv.cuh