]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
vulkan: initial support for IQ4_XS quantization (#11501)
authorRémy O <redacted>
Thu, 6 Feb 2025 06:09:59 +0000 (07:09 +0100)
committerGitHub <redacted>
Thu, 6 Feb 2025 06:09:59 +0000 (07:09 +0100)
commit8a7e3bf17aa5a8412854787746c92a28623a8925
tree0095e3317722c3035a03441b91b1b1a72c756507
parent1b598b30581bad59e5af86c94362f9a30f261fac
vulkan: initial support for IQ4_XS quantization (#11501)
13 files changed:
ggml/src/ggml-vulkan/ggml-vulkan.cpp
ggml/src/ggml-vulkan/vulkan-shaders/copy_from_quant.comp
ggml/src/ggml-vulkan/vulkan-shaders/copy_to_quant.comp
ggml/src/ggml-vulkan/vulkan-shaders/dequant_funcs.comp
ggml/src/ggml-vulkan/vulkan-shaders/dequant_funcs_cm2.comp
ggml/src/ggml-vulkan/vulkan-shaders/dequant_iq4_xs.comp [new file with mode: 0644]
ggml/src/ggml-vulkan/vulkan-shaders/flash_attn_cm2.comp
ggml/src/ggml-vulkan/vulkan-shaders/get_rows_quant.comp
ggml/src/ggml-vulkan/vulkan-shaders/mul_mat_vec.comp
ggml/src/ggml-vulkan/vulkan-shaders/mul_mm.comp
ggml/src/ggml-vulkan/vulkan-shaders/mul_mm_cm2.comp
ggml/src/ggml-vulkan/vulkan-shaders/types.comp
ggml/src/ggml-vulkan/vulkan-shaders/vulkan-shaders-gen.cpp