git.djapps.eu Git - pkg/ggml/sources/ggml/commit

]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit

overview / pkg / ggml / sources / ggml / commit

author	Aman Gupta <redacted>
	Fri, 31 Oct 2025 12:05:07 +0000 (20:05 +0800)
committer	Georgi Gerganov <redacted>
	Sat, 1 Nov 2025 07:41:35 +0000 (09:41 +0200)
commit	82b8e2697840fb3b483bcd47da22efaae79c226c
tree	bf7e0194367c9ccebe409f394fab835a595c5ca5	tree
parent	a4ec1c544de634cc6457cb77bcfabaff068a16d8	commit \| diff

CUDA: add expert reduce kernel (llama/16857)

* CUDA: add expert reduce kernel

* contigous checks, better formatting, use std::vector instead of array

* use vector empty instead of size

Co-authored-by: Johannes Gäßler <redacted>
---------

Co-authored-by: Johannes Gäßler <redacted>

src/ggml-cuda/ggml-cuda.cu		diff \| blob \| history
src/ggml-cuda/moe-expert-reduce.cu	[new file with mode: 0644]	blob
src/ggml-cuda/moe-expert-reduce.cuh	[new file with mode: 0644]	blob
tests/test-backend-ops.cpp		diff \| blob \| history

Packaging of ggml-org/ggml

RSS Atom