]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
CUDA: mul_mat_vec_q tiling, refactor mul mat logic (#5434)
authorJohannes Gäßler <redacted>
Sun, 11 Feb 2024 18:08:39 +0000 (19:08 +0100)
committerGitHub <redacted>
Sun, 11 Feb 2024 18:08:39 +0000 (19:08 +0100)
commit3bdc4cd0f595a6096cca4a64aa75ffa8a3503465
tree0f8301f3e190119cf2be81f73e42bb3f6435dc5b
parent2891c8aa9af17f4ff636ff3868bc34ff72b56e25
CUDA: mul_mat_vec_q tiling, refactor mul mat logic (#5434)

* CUDA: mul_mat_vec_q tiling, refactor mul mat logic

Co-authored-by: slaren <redacted>
---------

Co-authored-by: slaren <redacted>
ggml-cuda.cu