]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
model : support GLM 4.6 (make a few NextN/MTP tensors not required) (#16359)
authorBartowski <redacted>
Tue, 30 Sep 2025 20:24:36 +0000 (16:24 -0400)
committerGitHub <redacted>
Tue, 30 Sep 2025 20:24:36 +0000 (22:24 +0200)
commite74c92e84236b2bab3f3c77bee4ead94928be360
treeab4ce50fd16c0ebf55bdfdeabaaa3baf3a89f9d9
parentb2ba81dbe07b6dbea9c96b13346c66973dede32c
model : support GLM 4.6 (make a few NextN/MTP tensors not required) (#16359)

* Make a few GLM tensors not required

layer.nextn.shared_head_head and layer.nextn.embed_tokens are both excluded from GLM 4.6 resulting in the model not loading after conversion/quantization, this marks those tensors as not required which makes it work

* Update llama-model.cpp

layer.nextn.shared_head_norm also not required in case of future models
src/llama-model.cpp