git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

author	Aman Gupta <redacted>
	Thu, 26 Feb 2026 13:01:08 +0000 (21:01 +0800)
committer	GitHub <redacted>
	Thu, 26 Feb 2026 13:01:08 +0000 (21:01 +0800)
commit	b68d75165ad37ba1256cc45a43ec4f51cf813c3e
tree	3c7541bf60e9579ba1af1aa60d6961c1091b9a1a	tree
parent	ffaafde16ffebd2853467f0dd833625a726ce08e	commit \| diff

llama: Add option to merge gate and exp weights (#19139)

* llama: Add option to merge gate and exp weights

* Update convert_hf_to_gguf.py

Co-authored-by: Sigbjørn Skjæret <redacted>
* Update convert_hf_to_gguf.py

Co-authored-by: Sigbjørn Skjæret <redacted>
* update constants.py

* add gate_up for the all MoE models

* convert: simplify merge tensor condition

* update constants.py

* reduce number of models, add create_tensor_gate_up helper

---------

Co-authored-by: Sigbjørn Skjæret <redacted>

convert_hf_to_gguf.py		diff \| blob \| history
gguf-py/gguf/constants.py		diff \| blob \| history
gguf-py/gguf/tensor_mapping.py		diff \| blob \| history
src/llama-arch.cpp		diff \| blob \| history
src/llama-arch.h		diff \| blob \| history
src/llama-graph.cpp		diff \| blob \| history
src/llama-graph.h		diff \| blob \| history
src/llama-model.cpp		diff \| blob \| history
src/llama-model.h		diff \| blob \| history
src/models/deepseek2.cpp		diff \| blob \| history
src/models/qwen35moe.cpp		diff \| blob \| history
src/models/qwen3next.cpp		diff \| blob \| history