git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	fairydreaming <redacted>
	Sat, 4 Jan 2025 20:06:11 +0000 (21:06 +0100)
committer	GitHub <redacted>
	Sat, 4 Jan 2025 20:06:11 +0000 (21:06 +0100)
commit	9394bbd484f802ce80d2858033583af3ef700d25
tree	a4bcd1da4d11d3556d7f369f0d864d731445d55d	tree
parent	f922a9c542ee117550a168395c63ea79261f5c99	commit \| diff

llama : Add support for DeepSeek V3 (#11049)

* convert : extend DEEPSEEK2 model architecture to support DeepseekV3ForCausalLM by adding EXPERT_WEIGHTS_NORM and EXPERT_GATING_FUNC model parameters and FFN_EXP_PROBS_B tensor type

* vocab : add DeepSeek V3 pre-tokenizer regexes

* unicode : handle ACCENT_MARK and SYMBOL categories in regex

* llama : add DeepSeek V3 chat template, handle new model parameters and tensor types

---------

Co-authored-by: Stanisław Szymczyk <redacted>

16 files changed:

convert_hf_to_gguf.py		diff \| blob \| history
convert_hf_to_gguf_update.py		diff \| blob \| history
gguf-py/gguf/constants.py		diff \| blob \| history
gguf-py/gguf/gguf_writer.py		diff \| blob \| history
gguf-py/gguf/tensor_mapping.py		diff \| blob \| history
include/llama.h		diff \| blob \| history
src/llama-arch.cpp		diff \| blob \| history
src/llama-arch.h		diff \| blob \| history
src/llama-chat.cpp		diff \| blob \| history
src/llama-chat.h		diff \| blob \| history
src/llama-hparams.h		diff \| blob \| history
src/llama-model.cpp		diff \| blob \| history
src/llama-model.h		diff \| blob \| history
src/llama-vocab.cpp		diff \| blob \| history
src/llama.cpp		diff \| blob \| history
src/unicode.cpp		diff \| blob \| history

Packaging of ggml-org/llama.cpp

RSS Atom