]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
vulkan: support softmax/FA batch and broadcast (#14449)
authorJeff Bolz <redacted>
Tue, 1 Jul 2025 08:32:56 +0000 (03:32 -0500)
committerGeorgi Gerganov <redacted>
Wed, 2 Jul 2025 12:48:33 +0000 (15:48 +0300)
commit8875523eb311cac832bfda0c581e852292185ae9
treefdbcdb099bedf4e2620798bfa13eca567a29f566
parentec68e84c32325a3417fbcd2e60d4bda6adb4e4bc
vulkan: support softmax/FA batch and broadcast (#14449)
ggml/src/ggml-vulkan/ggml-vulkan.cpp
ggml/src/ggml-vulkan/vulkan-shaders/flash_attn.comp
ggml/src/ggml-vulkan/vulkan-shaders/flash_attn_base.comp
ggml/src/ggml-vulkan/vulkan-shaders/flash_attn_cm1.comp
ggml/src/ggml-vulkan/vulkan-shaders/flash_attn_cm2.comp
ggml/src/ggml-vulkan/vulkan-shaders/flash_attn_split_k_reduce.comp
ggml/src/ggml-vulkan/vulkan-shaders/soft_max.comp