]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
vulkan: fix SSM_CONV PP scaling with large ubatch sizes (llama/20379)
authorProgenyAlpha <redacted>
Thu, 12 Mar 2026 09:03:18 +0000 (05:03 -0400)
committerGeorgi Gerganov <redacted>
Mon, 16 Mar 2026 11:10:15 +0000 (13:10 +0200)
commit44c12c642e298672e90a970f33e57360d0a2f0ab
tree3cefb8061b9c30572b13bcc6312e534165385d96
parent7e816a99d21d0c9f0ce58a7004afe2d5c332f25a
vulkan: fix SSM_CONV PP scaling with large ubatch sizes (llama/20379)

* vulkan: optimize SSM_CONV workgroup dispatch for large ubatch

Tile tokens into 2D workgroups (32x16) to reduce workgroup launch
overhead at large ubatch sizes. Add vec4 fast path for nc=4 (common
d_conv size). Fixes PP performance degradation with ubatch > 512.

Ref: ggml-org/llama.cpp#18725

Co-Authored-By: Claude Opus 4.6 <redacted>
* vulkan: remove unused shared memory declaration in SSM_CONV

Co-Authored-By: Claude Opus 4.6 <redacted>
---------

Co-authored-by: Progeny Alpha <redacted>
Co-authored-by: Claude Opus 4.6 <redacted>
ggml/src/ggml-vulkan/ggml-vulkan.cpp
ggml/src/ggml-vulkan/vulkan-shaders/ssm_conv.comp