]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
llama : add support for Nemotron 3 Super (#20411)
authorDaniel Bevenius <redacted>
Wed, 11 Mar 2026 18:27:53 +0000 (19:27 +0100)
committerGitHub <redacted>
Wed, 11 Mar 2026 18:27:53 +0000 (19:27 +0100)
commiteaf1d7930c6857f304f8617d588d1a979efccb40
tree19818b8a239eae8e85b55e48b6905570a6ca2889
parent76ea1c1c46be0e824ca76545b0d75a15e0e275d4
llama : add support for Nemotron 3 Super (#20411)

* llama : add support for Nemotron 3 Super

This commit adds support for the Nemotron 3 Super model (120B.A12B)
enabling this model to be converted to GGUF format and run in llama.cpp.

Co-authored-by: Georgi Gerganov <redacted>
Co-authored-by: Matt Clayton <redacted>
convert_hf_to_gguf.py
ggml/src/ggml-metal/ggml-metal.metal
gguf-py/gguf/constants.py
gguf-py/gguf/gguf_writer.py
gguf-py/gguf/tensor_mapping.py
src/llama-arch.cpp
src/llama-arch.h
src/llama-hparams.h
src/llama-model.cpp
src/llama-model.h
src/models/nemotron-h.cpp