]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
vulkan: use vec dot for matrix matrix multiplications (#16056)
authorRuben Ortlam <redacted>
Sat, 20 Sep 2025 08:42:56 +0000 (10:42 +0200)
committerGitHub <redacted>
Sat, 20 Sep 2025 08:42:56 +0000 (10:42 +0200)
commit803dac2e48ef3ba26a504eb27c4e77ec2d21f7d0
treef522f7d9a529f42a037928b9910ffd50ab21e315
parent459c0c2c1a400f960d7b8e8d94d31a8426f80986
vulkan: use vec dot for matrix matrix multiplications (#16056)

* vulkan: Change the mul_mm shared memory and register caching system to use vec2 instead of scalars, to enable using dot2 instructions

* use fma instead of dot to fix Nvidia and Apple performance issues
ggml/src/ggml-vulkan/vulkan-shaders/mul_mm.comp
ggml/src/ggml-vulkan/vulkan-shaders/mul_mm_funcs.comp
ggml/src/ggml-vulkan/vulkan-shaders/types.comp
ggml/src/ggml-vulkan/vulkan-shaders/vulkan-shaders-gen.cpp