]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
vulkan: coopmat2 mul_mat optimizations (llama/14934)
authorJeff Bolz <redacted>
Sat, 2 Aug 2025 09:21:37 +0000 (04:21 -0500)
committerGeorgi Gerganov <redacted>
Sat, 2 Aug 2025 14:51:21 +0000 (17:51 +0300)
commitb3ff3e45e105bd103185c0eb4fa5d7e3160da108
treeb34738d5d1cb98a1515da243d43b6bd3fd3cc5ec
parent14e2ca624a72b569ac07a5858a1e98518ca164a7
vulkan: coopmat2 mul_mat optimizations (llama/14934)

- Increase tile size for k-quants, to match non-k-quants
- Choose more carefully between large and medium tiles, considering how it
  interacts with split_k
- Allow larger/non-power of two split_k, and make the splits a multiple of 256
- Use split_k==3 to when >1/2 and <=2/3 of the SMs would hae been used
src/ggml-vulkan/ggml-vulkan.cpp