]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
vulkan: support mixed/deepseekR1 FA head sizes (#14509)
authorJeff Bolz <redacted>
Thu, 3 Jul 2025 18:21:14 +0000 (13:21 -0500)
committerGitHub <redacted>
Thu, 3 Jul 2025 18:21:14 +0000 (20:21 +0200)
commit2b72bedec198a90bb5b0cceaf1d0aff9e34ffbc2
tree71fc6976911ec97eced27a96722763b49a81e694
parentc8c4495b8d3a8799e2d46778f993965b0ac1ae43
vulkan: support mixed/deepseekR1 FA head sizes (#14509)

* vulkan: better parameterize FA by head sizes

* vulkan: support mixed/deepseekR1 FA head sizes
ggml/src/ggml-vulkan/ggml-vulkan.cpp
ggml/src/ggml-vulkan/vulkan-shaders/flash_attn.comp
ggml/src/ggml-vulkan/vulkan-shaders/flash_attn_base.comp
ggml/src/ggml-vulkan/vulkan-shaders/flash_attn_cm1.comp
ggml/src/ggml-vulkan/vulkan-shaders/flash_attn_cm2.comp