]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
vulkan: Handle updated FA dim2/3 definition (llama/14518)
authorJeff Bolz <redacted>
Sat, 5 Jul 2025 07:26:04 +0000 (02:26 -0500)
committerGeorgi Gerganov <redacted>
Sat, 12 Jul 2025 16:23:56 +0000 (19:23 +0300)
commit3aebb8d5d3c7e67b21c31577d111213a1d116c99
tree67637fc4652f6160a7145ee296e1e8e98644ed7f
parentdf5af1dc75753644b63f7f22d5c230c23d8c0593
vulkan: Handle updated FA dim2/3 definition (llama/14518)

* vulkan: Handle updated FA dim2/3 definition

Pack mask boolean and n_head_log2 into a single dword to keep the push
constant block under the 128B limit.

* handle null mask for gqa

* allow gqa with dim3>1
ggml/src/ggml-vulkan/ggml-vulkan.cpp
ggml/src/ggml-vulkan/vulkan-shaders/flash_attn.comp
ggml/src/ggml-vulkan/vulkan-shaders/flash_attn_base.comp
ggml/src/ggml-vulkan/vulkan-shaders/flash_attn_cm1.comp
ggml/src/ggml-vulkan/vulkan-shaders/flash_attn_cm2.comp