]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
Respect tokenizer.ggml.add_bos_token value when tokenizing (#4040)
authorKerfuffle <redacted>
Fri, 17 Nov 2023 02:14:37 +0000 (19:14 -0700)
committerGitHub <redacted>
Fri, 17 Nov 2023 02:14:37 +0000 (19:14 -0700)
commit91f6499393d2d999331fbfdba47a7f8b9f913f0d
tree27caf3ad0b9cec979bb5ed3317b5334bdcd9470c
parent8da46278e1a57107591653275f8e03a281de94f0
Respect tokenizer.ggml.add_bos_token value when tokenizing (#4040)

* gguf-py: gguf-dump: Respect --no-tensor flag in JSON mode.

* Respect add_bos_token GGUF metadata value

* gguf-py: Try to fix SpecialVocab giving up too easily for the Nth time
12 files changed:
common/common.cpp
common/common.h
examples/infill/infill.cpp
examples/llava/llava-cli.cpp
examples/main/main.cpp
examples/perplexity/perplexity.cpp
examples/server/server.cpp
gguf-py/gguf/vocab.py
gguf-py/pyproject.toml
gguf-py/scripts/gguf-dump.py
llama.cpp
llama.h