]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
vulkan: matmul dequantization improvements (#12015)
authorEve <redacted>
Fri, 28 Feb 2025 07:20:08 +0000 (07:20 +0000)
committerGitHub <redacted>
Fri, 28 Feb 2025 07:20:08 +0000 (08:20 +0100)
commitfbeda9002d4b8b79a4f9288a7ff0d34ef4fb23d5
tree1d01f4a47b41a544aa1be13fca86ec80d665f4f6
parent581650b7cacec2872982fde381bd3bcda0f78699
vulkan: matmul dequantization improvements (#12015)

* faster dequant for old quants

* dont use unpack for iq4_nl

* vec2 unpack for q8
ggml/src/ggml-vulkan/vulkan-shaders/dequant_funcs.comp
ggml/src/ggml-vulkan/vulkan-shaders/dequant_funcs_cm2.comp
ggml/src/ggml-vulkan/vulkan-shaders/mul_mm.comp
ggml/src/ggml-vulkan/vulkan-shaders/types.comp
ggml/src/ggml-vulkan/vulkan-shaders/vulkan-shaders-gen.cpp