]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
CUDA: mul_mat_vec_q max. batch size 8 -> 4 (llama/5370)
authorJohannes Gäßler <redacted>
Tue, 6 Feb 2024 17:43:06 +0000 (18:43 +0100)
committerGeorgi Gerganov <redacted>
Sat, 10 Feb 2024 07:30:58 +0000 (09:30 +0200)
commit14541b44824336eb7867731a5d6ae3ccd235ca3b
treea4f5bd0bff249b6dc6b54f7b21e638c989d052f5
parent82389b52e3a676ae0a6e7801a5fc048d76010607
CUDA: mul_mat_vec_q max. batch size 8 -> 4 (llama/5370)
src/ggml-cuda.cu