]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
vulkan: Use larger workgroups for mul_mat_vec when M is small (llama/15355)
authorJeff Bolz <redacted>
Sun, 17 Aug 2025 16:08:57 +0000 (11:08 -0500)
committerGeorgi Gerganov <redacted>
Mon, 18 Aug 2025 17:30:45 +0000 (20:30 +0300)
commit6835e0cf77660905eed5d22744020cdbe5299e8f
tree89a27a1eb6950ef9ec6de3e3708f155443c6b735
parentc225f25907ab35108eef748f61698f7d54f7228d
vulkan: Use larger workgroups for mul_mat_vec when M is small (llama/15355)

* vulkan: Use larger workgroups for mul_mat_vec when M is small

Also use subgroup instructions for (part of) the reduction when supported.
Without this, the more expensive reductions would eat into the benefits of
the larger workgroups.

* update heuristic for amd/intel

Co-authored-by: 0cc4m <redacted>
---------

Co-authored-by: 0cc4m <redacted>
ggml/src/ggml-vulkan/ggml-vulkan.cpp
ggml/src/ggml-vulkan/vulkan-shaders/mul_mat_vec_base.comp
ggml/src/ggml-vulkan/vulkan-shaders/vulkan-shaders-gen.cpp