]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
vulkan: initial support for IQ4_XS quantization (llama/11501)
authorRémy O <redacted>
Thu, 6 Feb 2025 06:09:59 +0000 (07:09 +0100)
committerGeorgi Gerganov <redacted>
Wed, 12 Feb 2025 20:00:20 +0000 (22:00 +0200)
commit161ed74e28b0b3105384708d42e760a5244a57ab
tree0b0624e0cead77dac805d77d19440a76f5292aca
parent24a030d3bab3f57916663d9daaa987f3a2f0bfcc
vulkan: initial support for IQ4_XS quantization (llama/11501)
13 files changed:
src/ggml-vulkan/ggml-vulkan.cpp
src/ggml-vulkan/vulkan-shaders/copy_from_quant.comp
src/ggml-vulkan/vulkan-shaders/copy_to_quant.comp
src/ggml-vulkan/vulkan-shaders/dequant_funcs.comp
src/ggml-vulkan/vulkan-shaders/dequant_funcs_cm2.comp
src/ggml-vulkan/vulkan-shaders/dequant_iq4_xs.comp [new file with mode: 0644]
src/ggml-vulkan/vulkan-shaders/flash_attn_cm2.comp
src/ggml-vulkan/vulkan-shaders/get_rows_quant.comp
src/ggml-vulkan/vulkan-shaders/mul_mat_vec.comp
src/ggml-vulkan/vulkan-shaders/mul_mm.comp
src/ggml-vulkan/vulkan-shaders/mul_mm_cm2.comp
src/ggml-vulkan/vulkan-shaders/types.comp
src/ggml-vulkan/vulkan-shaders/vulkan-shaders-gen.cpp