git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Pedro Cuenca <redacted>
	Sun, 21 Apr 2024 11:50:41 +0000 (13:50 +0200)
committer	GitHub <redacted>
	Sun, 21 Apr 2024 11:50:41 +0000 (14:50 +0300)
commit	b97bc3966e852adb626c90be64fd48282800f504
tree	178656d15821205889fa03ec603c7327facbb265	tree
parent	b8109bc0139f15a5b321909f47510b89dca47ffc	commit \| diff

llama : support Llama 3 HF conversion (#6745)

* Support Llama 3 conversion

The tokenizer is BPE.

* style

* Accept suggestion

Co-authored-by: Sourab Mangrulkar <redacted>
* llama : add llama_token_is_eog()

ggml-ci

* llama : auto-detect more EOT tokens when missing in KV data

* convert : replacing EOS token is a hack

* llama : fix codegemma EOT token + add TODOs

* llama : fix model type string for 8B model

---------

Co-authored-by: Sourab Mangrulkar <redacted>
Co-authored-by: Georgi Gerganov <redacted>

20 files changed:

convert-hf-to-gguf.py		diff \| blob \| history
convert.py		diff \| blob \| history
examples/batched.swift/Sources/main.swift		diff \| blob \| history
examples/batched/batched.cpp		diff \| blob \| history
examples/beam-search/beam-search.cpp		diff \| blob \| history
examples/infill/infill.cpp		diff \| blob \| history
examples/llama.android/app/src/main/cpp/llama-android.cpp		diff \| blob \| history
examples/llama.swiftui/llama.cpp.swift/LibLlama.swift		diff \| blob \| history
examples/llava/llava-cli.cpp		diff \| blob \| history
examples/lookahead/lookahead.cpp		diff \| blob \| history
examples/lookup/lookup.cpp		diff \| blob \| history
examples/main/main.cpp		diff \| blob \| history
examples/parallel/parallel.cpp		diff \| blob \| history
examples/passkey/passkey.cpp		diff \| blob \| history
examples/server/server.cpp		diff \| blob \| history
examples/server/utils.hpp		diff \| blob \| history
examples/simple/simple.cpp		diff \| blob \| history
examples/speculative/speculative.cpp		diff \| blob \| history
llama.cpp		diff \| blob \| history
llama.h		diff \| blob \| history

Packaging of ggml-org/llama.cpp

RSS Atom