git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit

overview / pkg / ggml / sources / whisper.cpp / commit

author	Johannes Gäßler <redacted>
	Fri, 14 Jun 2024 16:41:49 +0000 (18:41 +0200)
committer	Georgi Gerganov <redacted>
	Sun, 16 Jun 2024 15:19:48 +0000 (18:19 +0300)
commit	b17ba2815b210dab8c610a20377e25f8254c5d41
tree	f54f326e4905ac0e678d6741b850ad738b7a8ff2	tree
parent	7a489af2f3c9eed983f6ba301db604f7dacee709	commit \| diff

CUDA: faster q2_K, q3_K MMQ + int8 tensor cores (llama/7921)

* CUDA: faster q2_K, q3_K MMQ + int8 tensor cores

* try CI fix

* try CI fix

* try CI fix

* fix data race

* rever q2_K precision related changes

ggml-cuda.cu		diff \| blob \| history
ggml-cuda/argsort.cu		diff \| blob \| history
ggml-cuda/common.cuh		diff \| blob \| history
ggml-cuda/mmq.cuh		diff \| blob \| history
ggml-cuda/softmax.cu		diff \| blob \| history
ggml-cuda/vecdotq.cuh		diff \| blob \| history

Packaging of ggerganov/whisper.cpp

RSS Atom