]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
vulkan: support GET_ROWS for k-quants (llama/16235)
authorJeff Bolz <redacted>
Sat, 27 Sep 2025 10:36:11 +0000 (06:36 -0400)
committerGeorgi Gerganov <redacted>
Mon, 29 Sep 2025 09:41:09 +0000 (12:41 +0300)
commit44dfaa8b3ffef7de000d7efcf75d60e42231e298
treeb48b79a0b3671e2b6a64fd4e10d73649c2f3a32b
parentf0a7623964480fcca5da78907f84c13df779824f
vulkan: support GET_ROWS for k-quants (llama/16235)

The dequantize functions are copy/pasted from mul_mm_funcs.comp with very few
changes - add a_offset and divide iqs by 2. It's probably possible to call
these functions from mul_mm_funcs and avoid the duplication, but I didn't go
that far in this change.
src/ggml-vulkan/ggml-vulkan.cpp
src/ggml-vulkan/vulkan-shaders/dequant_funcs.comp
src/ggml-vulkan/vulkan-shaders/types.comp
src/ggml-vulkan/vulkan-shaders/vulkan-shaders-gen.cpp