]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
CUDA: support for weight clamp in top-k norm (llama/16702)
authorAman Gupta <redacted>
Mon, 27 Oct 2025 01:06:16 +0000 (09:06 +0800)
committerGeorgi Gerganov <redacted>
Sat, 1 Nov 2025 07:41:35 +0000 (09:41 +0200)
commit720d0fb1a5bd23c8225de3aad6f451eaeb771763
tree89601993bab00edbce781a9232969bdaa14d1ce8
parent058128a60d08fadb61bc058393328c1939062ff3
CUDA: support for weight clamp in top-k norm (llama/16702)
src/ggml-cuda/ggml-cuda.cu
src/ggml-cuda/topk-moe.cu
src/ggml-cuda/topk-moe.cuh
tests/test-backend-ops.cpp