]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
llama : add support for Nemotron 3 Super (llama/20411)
authorDaniel Bevenius <redacted>
Wed, 11 Mar 2026 18:27:53 +0000 (19:27 +0100)
committerGeorgi Gerganov <redacted>
Sun, 15 Mar 2026 19:50:13 +0000 (21:50 +0200)
commitd19f6992deb46f88a64f844105a92a6be5d70149
tree1624e799db52615043d90d215f27b4f43b287948
parent74938e5d31f51b780a676a384389c01947b82728
llama : add support for Nemotron 3 Super (llama/20411)

* llama : add support for Nemotron 3 Super

This commit adds support for the Nemotron 3 Super model (120B.A12B)
enabling this model to be converted to GGUF format and run in llama.cpp.

Co-authored-by: Georgi Gerganov <redacted>
Co-authored-by: Matt Clayton <redacted>
src/ggml-metal/ggml-metal.metal