git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Bartowski <redacted>
	Tue, 30 Sep 2025 20:24:36 +0000 (16:24 -0400)
committer	GitHub <redacted>
	Tue, 30 Sep 2025 20:24:36 +0000 (22:24 +0200)
commit	e74c92e84236b2bab3f3c77bee4ead94928be360
tree	ab4ce50fd16c0ebf55bdfdeabaaa3baf3a89f9d9	tree
parent	b2ba81dbe07b6dbea9c96b13346c66973dede32c	commit \| diff

model : support GLM 4.6 (make a few NextN/MTP tensors not required) (#16359)

* Make a few GLM tensors not required

layer.nextn.shared_head_head and layer.nextn.embed_tokens are both excluded from GLM 4.6 resulting in the model not loading after conversion/quantization, this marks those tensors as not required which makes it work

* Update llama-model.cpp

layer.nextn.shared_head_norm also not required in case of future models

src/llama-model.cpp

diff | blob | history

Packaging of ggml-org/llama.cpp

RSS Atom