git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

author	Pierrick Hymbert <redacted>
	Sat, 23 Mar 2024 17:07:00 +0000 (18:07 +0100)
committer	GitHub <redacted>
	Sat, 23 Mar 2024 17:07:00 +0000 (18:07 +0100)
commit	f482bb2e4920e544651fb832f2e0bcb4d2ff69ab
tree	9fabefd6f3b34aef6bf13a8469c7cdf363cc88cb	tree
parent	1997577d5e121568ae39f538021733ccd4278c23	commit \| diff

common: llama_load_model_from_url split support (#6192)

* llama: llama_split_prefix fix strncpy does not include string termination
common: llama_load_model_from_url:
- fix header name case sensitive
- support downloading additional split in parallel
- hide password in url

* common: EOL EOF

* common: remove redundant LLAMA_CURL_MAX_PATH_LENGTH definition

* common: change max url max length

* common: minor comment

* server: support HF URL options

* llama: llama_model_loader fix log

* common: use a constant for max url length

* common: clean up curl if file cannot be loaded in gguf

* server: tests: add split tests, and HF options params

* common: move llama_download_hide_password_in_url inside llama_download_file as a lambda

* server: tests: enable back Release test on PR

* spacing

Co-authored-by: Georgi Gerganov <redacted>
* spacing

Co-authored-by: Georgi Gerganov <redacted>
* spacing

Co-authored-by: Georgi Gerganov <redacted>
---------

Co-authored-by: Georgi Gerganov <redacted>

.github/workflows/server.yml		diff \| blob \| history
common/common.cpp		diff \| blob \| history
common/common.h		diff \| blob \| history
examples/gguf-split/gguf-split.cpp		diff \| blob \| history
examples/server/README.md		diff \| blob \| history
examples/server/server.cpp		diff \| blob \| history
examples/server/tests/features/parallel.feature		diff \| blob \| history
examples/server/tests/features/server.feature		diff \| blob \| history
examples/server/tests/features/steps/steps.py		diff \| blob \| history
llama.cpp		diff \| blob \| history