git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

author	Sigbjørn Skjæret <redacted>
	Sun, 14 Sep 2025 21:00:59 +0000 (23:00 +0200)
committer	GitHub <redacted>
	Sun, 14 Sep 2025 21:00:59 +0000 (23:00 +0200)
commit	b8e09f08b9a91c0401bc67d17a17c90756420346
tree	88e907803f3fc85d8d24d682887967e4b51ac875	tree
parent	6c019cb04e86e2dacfe62ce7666c64e9717dde1f	commit \| diff

model : add grok-2 support (#15539)

* add grok-2 support

* type fix

* type fix

* type fix

* "fix" vocab for invalid sequences

* fix expert tensor mapping and spaces in vocab

* add chat template

* fix norm tensor mapping

* rename layer_out_norm to ffn_post_norm

* ensure ffn_post_norm is mapped

* fix experts merging

* remove erroneous FFN_GATE entry

* concatenate split tensors and add more metadata

* process all expert layers and try cat instead of hstack

* add support for community BPE vocab

* fix expert feed forward length and ffn_down concat

* commit this too

* add ffn_up/gate/down, unsure if sequence is right

* add ffn_gate/down/up to tensor names

* correct residual moe (still not working)

* mess--

* fix embedding scale being applied twice

* add built in chat template

* change beta fast for grok if default value

* remove spm vocab in favor of community bpe vocab

* change attention temp length metadata type to integer

* update attention temp length metadata

* remove comment

* replace M_SQRT2 with std::sqrt(2)

* add yarn metadata, move defaults to hparams

common/common.h		diff \| blob \| history
convert_hf_to_gguf.py		diff \| blob \| history
convert_hf_to_gguf_update.py		diff \| blob \| history
gguf-py/gguf/constants.py		diff \| blob \| history
gguf-py/gguf/gguf_writer.py		diff \| blob \| history
gguf-py/gguf/tensor_mapping.py		diff \| blob \| history
src/llama-arch.cpp		diff \| blob \| history
src/llama-arch.h		diff \| blob \| history
src/llama-chat.cpp		diff \| blob \| history
src/llama-chat.h		diff \| blob \| history
src/llama-context.cpp		diff \| blob \| history
src/llama-graph.cpp		diff \| blob \| history
src/llama-hparams.h		diff \| blob \| history
src/llama-model.cpp		diff \| blob \| history
src/llama-vocab.cpp		diff \| blob \| history
src/llama-vocab.h		diff \| blob \| history