git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

author	Kerfuffle <redacted>
	Sun, 22 Oct 2023 18:14:56 +0000 (12:14 -0600)
committer	GitHub <redacted>
	Sun, 22 Oct 2023 18:14:56 +0000 (21:14 +0300)
commit	a5e7dbd6141128bfa3c40a19c2945a181df625d3
tree	14cb15291418d4f591d7a58d8239eb02b966b595	tree
parent	d3956aea53369455008159cc405ed4c496976692	commit \| diff

llama : validate special token ids are in range when loading GGUF model (#3635)

* Add validation for special token ids to llama.cpp

Small optimization for llama_byte_to_token SPM mode

* Fix BPE newline check, only I could break something so simple

* Killll meeeeee

* Account for GGUF_KEY_KEY only setting when the key exists

* Minor code cleanups.

* Fix convert.py error msg when added tokens are out of range

* Make gguf SpecialVocab vocab size-aware

Update conversion scripts accordingly

* Avoid a string copy

Co-authored-by: Georgi Gerganov <redacted>
---------

Co-authored-by: Georgi Gerganov <redacted>

convert-baichuan-hf-to-gguf.py		diff \| blob \| history
convert-bloom-hf-to-gguf.py		diff \| blob \| history
convert-falcon-hf-to-gguf.py		diff \| blob \| history
convert-gptneox-hf-to-gguf.py		diff \| blob \| history
convert-llama-ggml-to-gguf.py		diff \| blob \| history
convert-mpt-hf-to-gguf.py		diff \| blob \| history
convert-refact-hf-to-gguf.py		diff \| blob \| history
convert-starcoder-hf-to-gguf.py		diff \| blob \| history
convert.py		diff \| blob \| history
gguf-py/gguf/gguf.py		diff \| blob \| history
llama.cpp		diff \| blob \| history