git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

summary | shortlog | log | commit | commitdiff | tree
(parent: 2075a66)

author	Johannes Gäßler <redacted>
	Thu, 20 Jun 2024 12:39:21 +0000 (14:39 +0200)
committer	GitHub <redacted>
	Thu, 20 Jun 2024 12:39:21 +0000 (14:39 +0200)
commit	d50f8897a797a5a03f31228d1b5a7b8130ee1bc2
tree	9ee91b29378e35ff8f7b5071308c12d429f316f0	tree
parent	2075a66a96cc1b04eabec7cf4b3051193d6f719e	commit \| diff

CUDA: stream-k decomposition for MMQ (#8018)

* CUDA: stream-k decomposition for MMQ

* fix undefined memory reads for small matrices

ggml-cuda.cu		diff \| blob \| history
ggml-cuda/common.cuh		diff \| blob \| history
ggml-cuda/mmq.cu		diff \| blob \| history
ggml-cuda/mmq.cuh		diff \| blob \| history

Packaging of ggml-org/llama.cpp