]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
CUDA: use registers instead of smem in topk-moe (llama/16647)
authorAman Gupta <redacted>
Sat, 18 Oct 2025 09:52:53 +0000 (17:52 +0800)
committerGeorgi Gerganov <redacted>
Tue, 21 Oct 2025 15:14:33 +0000 (18:14 +0300)
commit1fe7675fa66ca8c6930f93b54c3b40c8d964276b
treec47b5016bccd8a586cc4d7160c8e6fe2b8465aa9
parent90fac5e0309f3d88890175a1283b64f4042b852b
CUDA: use registers instead of smem in topk-moe (llama/16647)

Uses the technique used in the vulkan PR #16641. Neat trick!
src/ggml-cuda/topk-moe.cu