git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

author	0cc4m <redacted>
	Mon, 31 Mar 2025 12:37:01 +0000 (14:37 +0200)
committer	GitHub <redacted>
	Mon, 31 Mar 2025 12:37:01 +0000 (14:37 +0200)
commit	a8a1f3356786cbf8bcc3422e3c8737fc33b453e7
tree	d91544605f4bcacf0bbdaa138760a7ee02ade25a	tree
parent	1790e7315726ceda256ea91eaa66fc36b63f6067	commit \| diff

Vulkan: Add DP4A MMQ and Q8_1 quantization shader (#12135)

* Vulkan: Add DP4A MMQ and Q8_1 quantization shader

* Add q4_0 x q8_1 matrix matrix multiplication support

* Vulkan: Add int8 coopmat MMQ support

* Vulkan: Add q4_1, q5_0 and q5_1 quants, improve integer dot code

* Add GL_EXT_integer_dot_product check

* Remove ggml changes, fix mmq pipeline picker

* Remove ggml changes, restore Intel coopmat behaviour

* Fix glsl compile attempt when integer vec dot is not supported

* Remove redundant code, use non-saturating integer dot, enable all matmul sizes for mmq

* Remove redundant comment

* Fix integer dot check

* Fix compile issue with unsupported int dot glslc

* Update Windows build Vulkan SDK version

.github/workflows/build.yml		diff \| blob \| history
ggml/src/ggml-vulkan/CMakeLists.txt		diff \| blob \| history
ggml/src/ggml-vulkan/ggml-vulkan.cpp		diff \| blob \| history
ggml/src/ggml-vulkan/vulkan-shaders/mul_mm.comp		diff \| blob \| history
ggml/src/ggml-vulkan/vulkan-shaders/mul_mmq.comp	[new file with mode: 0644]	blob
ggml/src/ggml-vulkan/vulkan-shaders/mul_mmq_funcs.comp	[new file with mode: 0644]	blob
ggml/src/ggml-vulkan/vulkan-shaders/quantize_q8_1.comp	[new file with mode: 0644]	blob
ggml/src/ggml-vulkan/vulkan-shaders/test_integer_dot_support.comp	[new file with mode: 0644]	blob
ggml/src/ggml-vulkan/vulkan-shaders/types.comp		diff \| blob \| history
ggml/src/ggml-vulkan/vulkan-shaders/vulkan-shaders-gen.cpp		diff \| blob \| history