]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
CUDA: mul_mat_v support for batch sizes > 1 (#14262)
authorJohannes Gäßler <redacted>
Mon, 23 Jun 2025 11:11:31 +0000 (13:11 +0200)
committerGitHub <redacted>
Mon, 23 Jun 2025 11:11:31 +0000 (13:11 +0200)
commitdefe2158dd5e250b4ef53994057a4ec03131a263
tree5b32d69d5a785794e9bc78db7c28031dc08d16fe
parent7b50d589a863c7631135c1226f6eab65cb406212
CUDA: mul_mat_v support for batch sizes > 1 (#14262)

* CUDA: mul_mat_v support for batch sizes > 1

* use 64 bit math for initial offset calculation
ggml/src/ggml-cuda/common.cuh
ggml/src/ggml-cuda/ggml-cuda.cu
ggml/src/ggml-cuda/mmv.cu
ggml/src/ggml-cuda/mmv.cuh