git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Johannes Gäßler <redacted>
	Fri, 14 Jun 2024 16:41:49 +0000 (18:41 +0200)
committer	GitHub <redacted>
	Fri, 14 Jun 2024 16:41:49 +0000 (18:41 +0200)
commit	76d66ee0be91e2bec93206e821ee1db8d023cff5
tree	9bf121667539f91b90b54b237e54bdbd9a16161c	tree
parent	66ef1ceedf983773c8ceb4d925285d41d4e50e2a	commit \| diff

CUDA: faster q2_K, q3_K MMQ + int8 tensor cores (#7921)

* CUDA: faster q2_K, q3_K MMQ + int8 tensor cores

* try CI fix

* try CI fix

* try CI fix

* fix data race

* rever q2_K precision related changes

ggml-cuda.cu		diff \| blob \| history
ggml-cuda/argsort.cu		diff \| blob \| history
ggml-cuda/common.cuh		diff \| blob \| history
ggml-cuda/mmq.cuh		diff \| blob \| history
ggml-cuda/softmax.cu		diff \| blob \| history
ggml-cuda/vecdotq.cuh		diff \| blob \| history

Packaging of ggml-org/llama.cpp

RSS Atom