git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit

overview / pkg / ggml / sources / whisper.cpp / commit

author	Aman Gupta <redacted>
	Fri, 31 Oct 2025 12:05:07 +0000 (20:05 +0800)
committer	Georgi Gerganov <redacted>
	Sun, 9 Nov 2025 21:38:03 +0000 (23:38 +0200)
commit	7d60b431a58be9a5719444f0afd3bb5e0d1c9ab6
tree	214dd2c274fa0f39c907a3fedb915aefa30c4147	tree
parent	a9ba988e561d6bff85b29640400fa3986673f22d	commit \| diff

CUDA: add expert reduce kernel (llama/16857)

* CUDA: add expert reduce kernel

* contigous checks, better formatting, use std::vector instead of array

* use vector empty instead of size

Co-authored-by: Johannes Gäßler <redacted>
---------

Co-authored-by: Johannes Gäßler <redacted>

ggml/src/ggml-cuda/ggml-cuda.cu		diff \| blob \| history
ggml/src/ggml-cuda/moe-expert-reduce.cu	[new file with mode: 0644]	blob
ggml/src/ggml-cuda/moe-expert-reduce.cuh	[new file with mode: 0644]	blob

Packaging of ggerganov/whisper.cpp

RSS Atom