git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Xuan Son Nguyen <redacted>
	Thu, 24 Oct 2024 19:51:22 +0000 (21:51 +0200)
committer	GitHub <redacted>
	Thu, 24 Oct 2024 19:51:22 +0000 (21:51 +0200)
commit	958367bf530d943a902afa1ce1c342476098576b
tree	2388735e8c1c8db054ccfa4a3f27dfee79b74852	tree
parent	40f2555797f97314de749873cdc29dc102be66e2	commit \| diff

server : refactor slot input data, move tokenizer to HTTP thread (#10023)

* server : refactor slot input data, move tokenizer to HTTP thread

* move prompt_tokens.empty() check

* fix incorrect if branch

* fix infinite generation loop

* bring back infill validation

* add infill test

* try fixing format_infill

* fix test

* remove redundant code

* rename completion to inference

* update docs

* use llama_tokens everywhere

examples/server/README.md		diff \| blob \| history
examples/server/server.cpp		diff \| blob \| history
examples/server/tests/features/infill.feature	[new file with mode: 0644]	blob
examples/server/tests/features/steps/steps.py		diff \| blob \| history
examples/server/utils.hpp		diff \| blob \| history

Packaging of ggml-org/llama.cpp

RSS Atom