git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

summary | shortlog | log | commit | commitdiff | tree
(parent: 8138785)

author	Aman Gupta <redacted>
	Sat, 18 Oct 2025 09:52:53 +0000 (17:52 +0800)
committer	GitHub <redacted>
	Sat, 18 Oct 2025 09:52:53 +0000 (11:52 +0200)
commit	38355c6c8e43204e11a22daa7483082c0ff01e71
tree	36bb5b69df3a88d2ae32588dd5269379ee8d807f	tree
parent	81387858f1fbcc1acedbd308486e1016618ca8f8	commit \| diff

CUDA: use registers instead of smem in topk-moe (#16647)

Uses the technique used in the vulkan PR #16641. Neat trick!

ggml/src/ggml-cuda/topk-moe.cu

diff | blob | history

Packaging of ggml-org/llama.cpp