git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Jeff Bolz <redacted>
	Thu, 16 Jan 2025 21:47:10 +0000 (15:47 -0600)
committer	GitHub <redacted>
	Thu, 16 Jan 2025 21:47:10 +0000 (22:47 +0100)
commit	bd38ddea0181bc717de7cae66fd4323975c85656
tree	6659360cf92d88bd7b2ac00e83e3cb41d4243497	tree
parent	466300fe1416de2802b710215817db28d4496f41	commit \| diff

vulkan: support copy from f32 to q4_0/q4_1/q5_0/q5_1/q8_0/iq4_nl (#11166)

* vulkan: support copy from f32 to q4_0/q4_1/q5_0/q5_1/q8_0/iq4_nl

Shaders are based on cpy.cu.

* vulkan: support copy from q4_0/q4_1/q5_0/q5_1/q8_0/iq4_nl to f32

* ggml: copy q->f32 assumes some contiguity in the destination

ggml/src/ggml-cpu/ggml-cpu.c		diff \| blob \| history
ggml/src/ggml-vulkan/ggml-vulkan.cpp		diff \| blob \| history
ggml/src/ggml-vulkan/vulkan-shaders/copy_from_quant.comp	[new file with mode: 0644]	blob
ggml/src/ggml-vulkan/vulkan-shaders/copy_to_quant.comp	[new file with mode: 0644]	blob
ggml/src/ggml-vulkan/vulkan-shaders/generic_unary_head.comp		diff \| blob \| history
ggml/src/ggml-vulkan/vulkan-shaders/vulkan-shaders-gen.cpp		diff \| blob \| history
tests/test-backend-ops.cpp		diff \| blob \| history

Packaging of ggml-org/llama.cpp

RSS Atom