git.djapps.eu Git - pkg/ggml/sources/ggml/commit

]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit

overview / pkg / ggml / sources / ggml / commit

author	Aman Gupta <redacted>
	Tue, 14 Oct 2025 11:15:15 +0000 (19:15 +0800)
committer	Georgi Gerganov <redacted>
	Tue, 14 Oct 2025 19:07:44 +0000 (22:07 +0300)
commit	de71a099b784f9a3761c088b3491faeb0a6321b2
tree	9d7d153463ee63b29d9a34b7f834d2ee14277e24	tree
parent	f6a4d5889ed4e515e37a37a8c2de8c4e804675e6	commit \| diff

CUDA: add fp kernel for larger batch size MoE (llama/16512)

* CUDA: kernel for larger batch sizes for MoE

* WIP

* WIP

* WIP

* WIP

* WIP

* WIP

* fixup

* tests

* Move mmq_ids_helper to mmid

* cleanup

* Remove redundant checks

src/ggml-cuda/mmf.cu		diff \| blob \| history
src/ggml-cuda/mmf.cuh		diff \| blob \| history
src/ggml-cuda/mmid.cu	[new file with mode: 0644]	blob
src/ggml-cuda/mmid.cuh	[new file with mode: 0644]	blob
src/ggml-cuda/mmq.cu		diff \| blob \| history
tests/test-backend-ops.cpp		diff \| blob \| history

Packaging of ggml-org/ggml

RSS Atom