]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
CUDA: mul_mat_vec_q tiling, refactor mul mat logic (llama/5434)
authorJohannes Gäßler <redacted>
Sun, 11 Feb 2024 18:08:39 +0000 (19:08 +0100)
committerGeorgi Gerganov <redacted>
Mon, 12 Feb 2024 07:25:26 +0000 (09:25 +0200)
commit5f60268a839a7352757edb0cd21f7c1e2b6c8f46
tree817ee676ca90bded63ceae779396a2148fddf9a2
parentadc3925de2d3f015c07e82d0f77097e3e3795489
CUDA: mul_mat_vec_q tiling, refactor mul mat logic (llama/5434)

* CUDA: mul_mat_vec_q tiling, refactor mul mat logic

Co-authored-by: slaren <redacted>
---------

Co-authored-by: slaren <redacted>
src/ggml-cuda.cu