]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
vulkan: preprocess mul_mat_id experts and discard workgroups more quickly (llama...
authorJeff Bolz <redacted>
Fri, 26 Dec 2025 22:12:58 +0000 (16:12 -0600)
committerGeorgi Gerganov <redacted>
Wed, 31 Dec 2025 15:52:09 +0000 (17:52 +0200)
commit015b618d966a862e217a46b939d794e9fd8bc549
treea6994de0d1b97f8b8e78b91db44da1989a0cb799
parente37c8ed94ed0ef583de68c41e8d7cca764867093
vulkan: preprocess mul_mat_id experts and discard workgroups more quickly (llama/18352)

Run a preprocess to count how many times each expert is used, and use this to
quickly discard workgroups that aren't needed.
ggml/src/ggml-vulkan/ggml-vulkan.cpp
ggml/src/ggml-vulkan/vulkan-shaders/count_experts.comp [new file with mode: 0644]
ggml/src/ggml-vulkan/vulkan-shaders/mul_mm.comp
ggml/src/ggml-vulkan/vulkan-shaders/mul_mm_cm2.comp
ggml/src/ggml-vulkan/vulkan-shaders/mul_mm_id_funcs.glsl
ggml/src/ggml-vulkan/vulkan-shaders/mul_mmq.comp
ggml/src/ggml-vulkan/vulkan-shaders/vulkan-shaders-gen.cpp