]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
BERT tokenizer fixes (#6498)
authorJared Van Bortel <redacted>
Tue, 9 Apr 2024 17:44:08 +0000 (13:44 -0400)
committerGitHub <redacted>
Tue, 9 Apr 2024 17:44:08 +0000 (13:44 -0400)
commit1b67731e184e27a465b8c5476061294a4af668ea
tree15a2d877029fb509a34e462c227475bc7d6dc31e
parentc4a3a4ff47d62d2503ddf9bd91b58c21f04fe3c3
BERT tokenizer fixes (#6498)

Key changes:
* BERT conversion: fix abuse of LlamaHfVocab, do not set BOS or EOS
* Nomic Embed conversion: pad vocab instead of slicing embedding tensor
* llama_tokenize: handle added special tokens like HF does
20 files changed:
common/common.cpp
common/common.h
convert-hf-to-gguf.py
convert-persimmon-to-gguf.py
convert.py
examples/embedding/embedding.cpp
examples/imatrix/imatrix.cpp
examples/infill/infill.cpp
examples/llava/llava-cli.cpp
examples/lookahead/lookahead.cpp
examples/lookup/lookup-create.cpp
examples/lookup/lookup-stats.cpp
examples/lookup/lookup.cpp
examples/main/main.cpp
examples/perplexity/perplexity.cpp
examples/server/server.cpp
examples/speculative/speculative.cpp
examples/tokenize/tokenize.cpp
llama.cpp
llama.h