git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

author	Piotr Wilkin (ilintar) <redacted>
	Sun, 8 Feb 2026 23:24:08 +0000 (00:24 +0100)
committer	GitHub <redacted>
	Sun, 8 Feb 2026 23:24:08 +0000 (00:24 +0100)
commit	39bf692af1cba2a1072e4a42425611bf1ec2807d
tree	cea37ca0bdfd5efc0dd44084f942ca845b2f663c	tree
parent	e06088da0fa86aa444409f38dff274904931c507	commit \| diff

[Model] Qwen3.5 dense and MoE support (no vision) (#19435)

* Unified delta net handling

* Remove old methods.

* Refactor and optimize

* Adapt autoregressive version from @ymcki

* Change to decay mask approach

* Fix bad permute

* Qwen 3.5 support

* Apply suggestions from code review

Co-authored-by: Sigbjørn Skjæret <redacted>
* Further fixes

* Use inheritance, remove unneeded conts

* Not like this!

* Remove ggml.h explicit import

* Remove transformers, fix the views

* ACTUALLY fix views, make super calls explicit in conversion.

* Fix conversion again

* Remove extra ggml.h imports

---------

Co-authored-by: Sigbjørn Skjæret <redacted>

convert_hf_to_gguf.py		diff \| blob \| history
gguf-py/gguf/constants.py		diff \| blob \| history
gguf-py/gguf/tensor_mapping.py		diff \| blob \| history
src/CMakeLists.txt		diff \| blob \| history
src/llama-arch.cpp		diff \| blob \| history
src/llama-arch.h		diff \| blob \| history
src/llama-context.cpp		diff \| blob \| history
src/llama-model.cpp		diff \| blob \| history
src/models/delta.cpp	[new file with mode: 0644]	blob
src/models/kimi-linear.cpp		diff \| blob \| history
src/models/models.h		diff \| blob \| history
src/models/qwen3-5.cpp	[new file with mode: 0644]	blob
src/models/qwen3-5moe.cpp	[new file with mode: 0644]	blob
src/models/qwen3next.cpp		diff \| blob \| history