git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit

overview / pkg / ggml / sources / whisper.cpp / commit

author	Jeff Bolz <redacted>
	Sun, 17 Aug 2025 16:08:57 +0000 (11:08 -0500)
committer	Georgi Gerganov <redacted>
	Mon, 18 Aug 2025 17:30:45 +0000 (20:30 +0300)
commit	6835e0cf77660905eed5d22744020cdbe5299e8f
tree	89a27a1eb6950ef9ec6de3e3708f155443c6b735	tree
parent	c225f25907ab35108eef748f61698f7d54f7228d	commit \| diff

vulkan: Use larger workgroups for mul_mat_vec when M is small (llama/15355)

* vulkan: Use larger workgroups for mul_mat_vec when M is small

Also use subgroup instructions for (part of) the reduction when supported.
Without this, the more expensive reductions would eat into the benefits of
the larger workgroups.

* update heuristic for amd/intel

Co-authored-by: 0cc4m <redacted>
---------

Co-authored-by: 0cc4m <redacted>

ggml/src/ggml-vulkan/ggml-vulkan.cpp		diff \| blob \| history
ggml/src/ggml-vulkan/vulkan-shaders/mul_mat_vec_base.comp		diff \| blob \| history
ggml/src/ggml-vulkan/vulkan-shaders/vulkan-shaders-gen.cpp		diff \| blob \| history

Packaging of ggerganov/whisper.cpp

RSS Atom