]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
vulkan: initial support for IQ1_S and IQ1_M quantizations (#11528)
authorRémy O <redacted>
Sat, 15 Feb 2025 08:01:40 +0000 (09:01 +0100)
committerGitHub <redacted>
Sat, 15 Feb 2025 08:01:40 +0000 (09:01 +0100)
commitfc1b0d0936e4dfc52a81f38e7420c7d23f6caa88
treee9132e5fd0656146489e9d46933515bc380ba8b9
parent89daa2564f6eab33be53c6a1b39273af536d6bb3
vulkan: initial support for IQ1_S and IQ1_M quantizations (#11528)

* vulkan: initial support for IQ1_S and IQ1_M quantizations

* vulkan: define MMV kernels for IQ1 quantizations

* devops: increase timeout of Vulkan tests again

* vulkan: simplify ifdef for init_iq_shmem
17 files changed:
.github/workflows/build.yml
ggml/src/ggml-vulkan/ggml-vulkan.cpp
ggml/src/ggml-vulkan/vulkan-shaders/copy_from_quant.comp
ggml/src/ggml-vulkan/vulkan-shaders/copy_to_quant.comp
ggml/src/ggml-vulkan/vulkan-shaders/dequant_funcs.comp
ggml/src/ggml-vulkan/vulkan-shaders/dequant_funcs_cm2.comp
ggml/src/ggml-vulkan/vulkan-shaders/dequant_iq1_m.comp [new file with mode: 0644]
ggml/src/ggml-vulkan/vulkan-shaders/dequant_iq1_s.comp [new file with mode: 0644]
ggml/src/ggml-vulkan/vulkan-shaders/flash_attn_cm2.comp
ggml/src/ggml-vulkan/vulkan-shaders/get_rows_quant.comp
ggml/src/ggml-vulkan/vulkan-shaders/mul_mat_vec.comp
ggml/src/ggml-vulkan/vulkan-shaders/mul_mat_vec_iq1_m.comp [new file with mode: 0644]
ggml/src/ggml-vulkan/vulkan-shaders/mul_mat_vec_iq1_s.comp [new file with mode: 0644]
ggml/src/ggml-vulkan/vulkan-shaders/mul_mm.comp
ggml/src/ggml-vulkan/vulkan-shaders/mul_mm_cm2.comp
ggml/src/ggml-vulkan/vulkan-shaders/types.comp
ggml/src/ggml-vulkan/vulkan-shaders/vulkan-shaders-gen.cpp