git.djapps.eu Git - pkg/ggml/sources/ggml/commit

]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit

overview / pkg / ggml / sources / ggml / commit

author	Jeff Bolz <redacted>
	Sat, 18 Oct 2025 10:22:57 +0000 (05:22 -0500)
committer	Georgi Gerganov <redacted>
	Tue, 21 Oct 2025 15:14:33 +0000 (18:14 +0300)
commit	c48d0af1d915f980f03aef2fbb8e8ee569b50756
tree	5e3bfb5f4091f69b0ff7e5052a7bedbaeae1e993	tree
parent	1fe7675fa66ca8c6930f93b54c3b40c8d964276b	commit \| diff

vulkan: Implement topk_moe fused shader, ported from CUDA (llama/16641)

This is similar to the CUDA shader from #16130, but doesn't use shared memory
and handles different subgroup sizes.

src/ggml-impl.h		diff \| blob \| history
src/ggml-vulkan/ggml-vulkan.cpp		diff \| blob \| history
src/ggml-vulkan/vulkan-shaders/topk_moe.comp	[new file with mode: 0644]	blob
src/ggml-vulkan/vulkan-shaders/vulkan-shaders-gen.cpp		diff \| blob \| history

Packaging of ggml-org/ggml

RSS Atom