]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
vulkan: split mul_mat into multiple dispatches to avoid overflow (llama/19509)
authorJeff Bolz <redacted>
Wed, 18 Feb 2026 09:47:10 +0000 (01:47 -0800)
committerGeorgi Gerganov <redacted>
Fri, 27 Feb 2026 18:57:58 +0000 (20:57 +0200)
commitf1da0a26f5adefd33a5d5b88ebfc9350ec2afa67
tree93094533e0168ff5f7574900ee3662359ea3d26c
parent51ce7de94ca9508f2baf05c82f027ed393e019c5
vulkan: split mul_mat into multiple dispatches to avoid overflow (llama/19509)

* vulkan: split mul_mat into multiple dispatches to avoid overflow

The batch dimensions can be greater than the max workgroup count limit,
in which case we need to split into multiple dispatches and pass the base
index through a push constant.

Fall back for the less common p021 and nc variants.

* address feedback
ggml/src/ggml-vulkan/ggml-vulkan.cpp
ggml/src/ggml-vulkan/vulkan-shaders/mul_mat_vec_base.glsl
ggml/src/ggml-vulkan/vulkan-shaders/mul_mm.comp
ggml/src/ggml-vulkan/vulkan-shaders/mul_mm_cm2.comp