]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
model : add hunyuan moe (#14425)
authorXuan-Son Nguyen <redacted>
Tue, 8 Jul 2025 08:24:06 +0000 (10:24 +0200)
committerGitHub <redacted>
Tue, 8 Jul 2025 08:24:06 +0000 (11:24 +0300)
commit8f22dc0a53338c629c1ef8fa878d8e39bfe627c9
treee18533ec37dcc3ac3e254d77ca7b99b895dbb1bd
parent53903ae6fa5f1caf187889c839cdd1ad25da4018
model : add hunyuan moe (#14425)

* model : add hunyuan moe

* tokenizer ok

* fix tensor name

* cgraph init

* chat template

* wip

* almost working

* skip embed, fix bos

* cleanup

* yarn scaling

* cleanup

* correct rope type

* failed token fix

* ntk alpha freq_base

* tokenization working

* cleanup and pr changes

* vocab_size sanity check

* ntk alpha generic

* Update convert_hf_to_gguf.py

* Apply suggestions from code review

* fix regression

* fix style

---------

Co-authored-by: kooshi <redacted>
12 files changed:
convert_hf_to_gguf.py
convert_hf_to_gguf_update.py
gguf-py/gguf/constants.py
gguf-py/gguf/tensor_mapping.py
include/llama.h
src/llama-arch.cpp
src/llama-arch.h
src/llama-chat.cpp
src/llama-chat.h
src/llama-model.cpp
src/llama-model.h
src/llama-vocab.cpp