]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
vulkan: support GET_ROWS for k-quants (#16235)
authorJeff Bolz <redacted>
Sat, 27 Sep 2025 10:36:11 +0000 (06:36 -0400)
committerGitHub <redacted>
Sat, 27 Sep 2025 10:36:11 +0000 (12:36 +0200)
commit3f81b4e91c1d5f098148af117e3f13cf4b077f52
tree988e2da42897c638628025d6edbc30f7c1c21ee9
parentace6a54565444b6377bee8e7ac693238e7766279
vulkan: support GET_ROWS for k-quants (#16235)

The dequantize functions are copy/pasted from mul_mm_funcs.comp with very few
changes - add a_offset and divide iqs by 2. It's probably possible to call
these functions from mul_mm_funcs and avoid the duplication, but I didn't go
that far in this change.
ggml/src/ggml-vulkan/ggml-vulkan.cpp
ggml/src/ggml-vulkan/vulkan-shaders/dequant_funcs.comp
ggml/src/ggml-vulkan/vulkan-shaders/types.comp
ggml/src/ggml-vulkan/vulkan-shaders/vulkan-shaders-gen.cpp