git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Xuan-Son Nguyen <redacted>
	Tue, 8 Jul 2025 08:24:06 +0000 (10:24 +0200)
committer	GitHub <redacted>
	Tue, 8 Jul 2025 08:24:06 +0000 (11:24 +0300)
commit	8f22dc0a53338c629c1ef8fa878d8e39bfe627c9
tree	e18533ec37dcc3ac3e254d77ca7b99b895dbb1bd	tree
parent	53903ae6fa5f1caf187889c839cdd1ad25da4018	commit \| diff

model : add hunyuan moe (#14425)

* model : add hunyuan moe

* tokenizer ok

* fix tensor name

* cgraph init

* chat template

* wip

* almost working

* skip embed, fix bos

* cleanup

* yarn scaling

* cleanup

* correct rope type

* failed token fix

* ntk alpha freq_base

* tokenization working

* cleanup and pr changes

* vocab_size sanity check

* ntk alpha generic

* Update convert_hf_to_gguf.py

* Apply suggestions from code review

* fix regression

* fix style

---------

Co-authored-by: kooshi <redacted>

12 files changed:

convert_hf_to_gguf.py		diff \| blob \| history
convert_hf_to_gguf_update.py		diff \| blob \| history
gguf-py/gguf/constants.py		diff \| blob \| history
gguf-py/gguf/tensor_mapping.py		diff \| blob \| history
include/llama.h		diff \| blob \| history
src/llama-arch.cpp		diff \| blob \| history
src/llama-arch.h		diff \| blob \| history
src/llama-chat.cpp		diff \| blob \| history
src/llama-chat.h		diff \| blob \| history
src/llama-model.cpp		diff \| blob \| history
src/llama-model.h		diff \| blob \| history
src/llama-vocab.cpp		diff \| blob \| history

Packaging of ggml-org/llama.cpp

RSS Atom