]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
model: add Ernie 4.5 MoE support (#14658)
authorPiotr Wilkin (ilintar) <redacted>
Thu, 17 Jul 2025 21:15:32 +0000 (23:15 +0200)
committerGitHub <redacted>
Thu, 17 Jul 2025 21:15:32 +0000 (23:15 +0200)
commitcb887f1bc1001c92f7b4a595b9014f3a454a07ab
tree6f0bd41846d59dfc3a3f232ee692614811fe0d83
parentd6fb3f6b49b27ef1c0f4cf5128e041f7e7dc03af
model: add Ernie 4.5 MoE support (#14658)

* Add Ernie4.5 MoE

* Fix Flake errors.

* Properly encode/decode MoE layer step

* Correct tensor mappings (.weight)

* Pass and read n_ff_exp

* n_ff_shexp calculation and further minor changes

* Rope fixes.

* .gitignore fix

* Add unit32 cast for Linux builds

* Apply suggestions from code review

Co-authored-by: Sigbjørn Skjæret <redacted>
* Further fixes from code review

* Fix trailing whitespace

* Reenable missing experts error

* Code style from code review

Co-authored-by: Sigbjørn Skjæret <redacted>
* Fix non-MoE regression

Co-authored-by: Sigbjørn Skjæret <redacted>
---------

Co-authored-by: Sigbjørn Skjæret <redacted>
convert_hf_to_gguf.py
gguf-py/gguf/constants.py
gguf-py/gguf/tensor_mapping.py
src/llama-arch.cpp
src/llama-arch.h
src/llama-model.cpp
src/llama-model.h