]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
vulkan: optimize mul_mat_id loading row ids into shared memory (llama/15427)
authorJeff Bolz <redacted>
Sat, 23 Aug 2025 06:31:54 +0000 (01:31 -0500)
committerGeorgi Gerganov <redacted>
Sat, 20 Sep 2025 10:42:40 +0000 (13:42 +0300)
commit485c5c3b3b18048f6dde5e3f6797dfbc4e53cd98
tree791aa62d83b77bdeee7ff92ae14890716e5678f6
parentbb5d7e2c3137c43e023e15b93d910ffd448b54e6
vulkan: optimize mul_mat_id loading row ids into shared memory (llama/15427)

- Spread the work across the whole workgroup. Using more threads seems to
far outweigh the synchronization overhead.
- Specialize the code for when the division is by a power of two.
ggml/src/ggml-vulkan/ggml-vulkan.cpp
ggml/src/ggml-vulkan/vulkan-shaders/mul_mm.comp
ggml/src/ggml-vulkan/vulkan-shaders/mul_mm_cm2.comp