]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
vulkan: Use larger loads in scalar/coopmat1 matmul (llama/15729)
authorJeff Bolz <redacted>
Sun, 7 Sep 2025 16:53:07 +0000 (11:53 -0500)
committerGeorgi Gerganov <redacted>
Sat, 20 Sep 2025 10:42:51 +0000 (13:42 +0300)
commit647e2d7de59df8c22623bef52093fa2e419c1197
tree24aa0dc568fe222399ce6d420471c00a7bb00192
parentcda7d4e5acfb1697edaf6e035baf9271f384c452
vulkan: Use larger loads in scalar/coopmat1 matmul (llama/15729)

I think glslang will translate an access like x[i][1].z to
OpAccessChain ... x, i, 1, 2
OpLoad float16_t ...

rather than loading all of x[i] in a single OpLoad. Change the
code to explicitly load the vector/matrix.
ggml/src/ggml-vulkan/vulkan-shaders/mul_mm.comp
ggml/src/ggml-vulkan/vulkan-shaders/types.comp
ggml/src/ggml-vulkan/vulkan-shaders/vulkan-shaders-gen.cpp