]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
vulkan: support copy from f32 to q4_0/q4_1/q5_0/q5_1/q8_0/iq4_nl (#11166)
authorJeff Bolz <redacted>
Thu, 16 Jan 2025 21:47:10 +0000 (15:47 -0600)
committerGitHub <redacted>
Thu, 16 Jan 2025 21:47:10 +0000 (22:47 +0100)
commitbd38ddea0181bc717de7cae66fd4323975c85656
tree6659360cf92d88bd7b2ac00e83e3cb41d4243497
parent466300fe1416de2802b710215817db28d4496f41
vulkan: support copy from f32 to q4_0/q4_1/q5_0/q5_1/q8_0/iq4_nl (#11166)

* vulkan: support copy from f32 to q4_0/q4_1/q5_0/q5_1/q8_0/iq4_nl

Shaders are based on cpy.cu.

* vulkan: support copy from q4_0/q4_1/q5_0/q5_1/q8_0/iq4_nl to f32

* ggml: copy q->f32 assumes some contiguity in the destination
ggml/src/ggml-cpu/ggml-cpu.c
ggml/src/ggml-vulkan/ggml-vulkan.cpp
ggml/src/ggml-vulkan/vulkan-shaders/copy_from_quant.comp [new file with mode: 0644]
ggml/src/ggml-vulkan/vulkan-shaders/copy_to_quant.comp [new file with mode: 0644]
ggml/src/ggml-vulkan/vulkan-shaders/generic_unary_head.comp
ggml/src/ggml-vulkan/vulkan-shaders/vulkan-shaders-gen.cpp
tests/test-backend-ops.cpp