]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
vulkan: optimize and reenable split_k (llama/10637)
authorJeff Bolz <redacted>
Tue, 3 Dec 2024 19:29:54 +0000 (13:29 -0600)
committerGeorgi Gerganov <redacted>
Sun, 8 Dec 2024 18:14:35 +0000 (20:14 +0200)
commit015ecd0001adff23b727c62bac60074228d57add
tree2901542c35109a5263f2861f268b3595523f342c
parentb7c64a4352ee8cc26ad359cb2faad4d251d20049
vulkan: optimize and reenable split_k (llama/10637)

Use vector loads when possible in mul_mat_split_k_reduce. Use split_k
when there aren't enough workgroups to fill the shaders.
ggml/src/ggml-vulkan/ggml-vulkan.cpp
ggml/src/ggml-vulkan/vulkan-shaders/mul_mat_split_k_reduce.comp