git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

author	Piotr Wilkin (ilintar) <redacted>
	Thu, 17 Jul 2025 21:15:32 +0000 (23:15 +0200)
committer	GitHub <redacted>
	Thu, 17 Jul 2025 21:15:32 +0000 (23:15 +0200)
commit	cb887f1bc1001c92f7b4a595b9014f3a454a07ab
tree	6f0bd41846d59dfc3a3f232ee692614811fe0d83	tree
parent	d6fb3f6b49b27ef1c0f4cf5128e041f7e7dc03af	commit \| diff

model: add Ernie 4.5 MoE support (#14658)

* Add Ernie4.5 MoE

* Fix Flake errors.

* Properly encode/decode MoE layer step

* Correct tensor mappings (.weight)

* Pass and read n_ff_exp

* n_ff_shexp calculation and further minor changes

* Rope fixes.

* .gitignore fix

* Add unit32 cast for Linux builds

* Apply suggestions from code review

Co-authored-by: Sigbjørn Skjæret <redacted>
* Further fixes from code review

* Fix trailing whitespace

* Reenable missing experts error

* Code style from code review

Co-authored-by: Sigbjørn Skjæret <redacted>
* Fix non-MoE regression

Co-authored-by: Sigbjørn Skjæret <redacted>
---------

Co-authored-by: Sigbjørn Skjæret <redacted>

convert_hf_to_gguf.py		diff \| blob \| history
gguf-py/gguf/constants.py		diff \| blob \| history
gguf-py/gguf/tensor_mapping.py		diff \| blob \| history
src/llama-arch.cpp		diff \| blob \| history
src/llama-arch.h		diff \| blob \| history
src/llama-model.cpp		diff \| blob \| history
src/llama-model.h		diff \| blob \| history