git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit

overview / pkg / ggml / sources / whisper.cpp / commit

author	Oliver Simons <redacted>
	Sat, 1 Nov 2025 05:13:26 +0000 (06:13 +0100)
committer	Georgi Gerganov <redacted>
	Sun, 9 Nov 2025 21:38:03 +0000 (23:38 +0200)
commit	7d55fba06f6c85ccafe09c75980c2f5ce23a3cad
tree	a160f38d30034bda9ed5c3d5826bdbc233605f29	tree
parent	52e1bbb5542f5dafca8b811cc857751b88d76064	commit \| diff

CUDA: Remove unneded bias/gate dims in fused mmvq (llama/16858)

* CUDA: Remove unneded bias/gate dims in fused mmvq

Pointed out
[here](https://github.com/ggml-org/llama.cpp/pull/16847#discussion_r2476798989)
that only a single value is needed per target col per thread

* Apply suggestions from code review

Co-authored-by: Johannes Gäßler <redacted>
* Fix "Error 991-D: extra braces are nonstandard" during compilation

---------

Co-authored-by: Johannes Gäßler <redacted>

ggml/src/ggml-cuda/mmvq.cu

diff | blob | history

Packaging of ggerganov/whisper.cpp

RSS Atom