]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
CUDA: refactor topk-moe to enable more models (GLM 4.7, Nemotron etc.) (llama/19126)
authorAman Gupta <redacted>
Thu, 29 Jan 2026 02:31:28 +0000 (10:31 +0800)
committerGeorgi Gerganov <redacted>
Fri, 30 Jan 2026 13:56:40 +0000 (15:56 +0200)
commit62ba8b537fef7b96727dbd3efa0d035ada52cb7d
tree074de1e88123992d5a46f94f9e46999d14312437
parentf0e85bb142fdfdd3cb30d964385ddebea4f84c12
CUDA: refactor topk-moe to enable more models (GLM 4.7, Nemotron etc.) (llama/19126)
ggml/src/ggml-cuda/ggml-cuda.cu
ggml/src/ggml-cuda/topk-moe.cu
ggml/src/ggml-cuda/topk-moe.cuh