git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Xuan-Son Nguyen <redacted>
	Wed, 24 Dec 2025 22:07:08 +0000 (23:07 +0100)
committer	GitHub <redacted>
	Wed, 24 Dec 2025 22:07:08 +0000 (23:07 +0100)
commit	4cbafad4f09c82f1b32c76b714302f3937be1df7
tree	ed3e1bf72992069f85204050bd09f74c02c6bd8b	tree
parent	c18428423018ed214c004e6ecaedb0cbdda06805	commit \| diff

model: support MiMo-V2-Flash (#18328)

* mimov2: convert ok

* rename mimov2 --> mimo2

* fix conversion

* runnable not incorrect

* use sink

* add_sliding_window_pattern

* add swa and per-layer n_head_kv

* correct params

* somewhat working

* correct gating func

* nits

* mimo2: wire RMS eps + MoE bias + converter guards

* add co-author

Co-authored-by: Aaryan-Kapoor <redacted>
* use add_rope_freq_base_swa

---------

Co-authored-by: Aaryan Kapoor <redacted>
Co-authored-by: Aaryan-Kapoor <redacted>

convert_hf_to_gguf.py		diff \| blob \| history
gguf-py/gguf/constants.py		diff \| blob \| history
gguf-py/gguf/tensor_mapping.py		diff \| blob \| history
src/CMakeLists.txt		diff \| blob \| history
src/llama-arch.cpp		diff \| blob \| history
src/llama-arch.h		diff \| blob \| history
src/llama-hparams.h		diff \| blob \| history
src/llama-model.cpp		diff \| blob \| history
src/llama-model.h		diff \| blob \| history
src/models/mimo2-iswa.cpp	[new file with mode: 0644]	blob
src/models/models.h		diff \| blob \| history

Packaging of ggml-org/llama.cpp

RSS Atom