]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
vulkan: Handle updated FA dim2/3 definition (#14518)
authorJeff Bolz <redacted>
Sat, 5 Jul 2025 07:26:04 +0000 (02:26 -0500)
committerGitHub <redacted>
Sat, 5 Jul 2025 07:26:04 +0000 (09:26 +0200)
commita0374a67e2924f2e845cdc59dd67d9a44065a89c
tree3b039a8464bc3130a6cfa01647af8339fdcdc9d4
parentddef99522d1ba74193b7394e803fab8db5c78bae
vulkan: Handle updated FA dim2/3 definition (#14518)

* vulkan: Handle updated FA dim2/3 definition

Pack mask boolean and n_head_log2 into a single dword to keep the push
constant block under the 128B limit.

* handle null mask for gqa

* allow gqa with dim3>1
ggml/src/ggml-vulkan/ggml-vulkan.cpp
ggml/src/ggml-vulkan/vulkan-shaders/flash_attn.comp
ggml/src/ggml-vulkan/vulkan-shaders/flash_attn_base.comp
ggml/src/ggml-vulkan/vulkan-shaders/flash_attn_cm1.comp
ggml/src/ggml-vulkan/vulkan-shaders/flash_attn_cm2.comp