git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	slaren <redacted>
	Thu, 28 Sep 2023 19:42:38 +0000 (21:42 +0200)
committer	GitHub <redacted>
	Thu, 28 Sep 2023 19:42:38 +0000 (22:42 +0300)
commit	16bc66d9479edd5ee12ec734973554d4493c5dfa
tree	4cca787ebd86dd55fd176d27112117c74e9b34c6	tree
parent	0512d66670de3f650c579519833c085014b0f200	commit \| diff

llama.cpp : split llama_context_params into model and context params (#3301)

* llama.cpp : split llama_context_params into model and context params

ggml-ci

* fix metal build

* fix freq_base/scale default to model value

* llama-bench : keep the same model between tests when possible

* move n_threads to llama_context_params, add n_threads_batch

* fix mpi build

* remove kv_size(), cuda scratch fixes

* remove low-vram option

* add n_threads_batch to system info, refactor to get_system_info()

* add documentation about --threads-batch to the READMEs

* llama-bench fix

* main : fix rope freq/scale warning

* llama.cpp : add llama_get_model
common : add llama_tokenize from model

* remove duplicated ctx/model functions

ggml-ci

* cuda : print total VRAM used

27 files changed:

common/common.cpp		diff \| blob \| history
common/common.h		diff \| blob \| history
common/train.cpp		diff \| blob \| history
examples/batched/batched.cpp		diff \| blob \| history
examples/beam-search/beam-search.cpp		diff \| blob \| history
examples/embd-input/embd-input-lib.cpp		diff \| blob \| history
examples/embd-input/embd-input-test.cpp		diff \| blob \| history
examples/embedding/embedding.cpp		diff \| blob \| history
examples/finetune/finetune.cpp		diff \| blob \| history
examples/llama-bench/llama-bench.cpp		diff \| blob \| history
examples/main/README.md		diff \| blob \| history
examples/main/main.cpp		diff \| blob \| history
examples/parallel/parallel.cpp		diff \| blob \| history
examples/perplexity/perplexity.cpp		diff \| blob \| history
examples/quantize-stats/quantize-stats.cpp		diff \| blob \| history
examples/save-load-state/save-load-state.cpp		diff \| blob \| history
examples/server/README.md		diff \| blob \| history
examples/server/server.cpp		diff \| blob \| history
examples/simple/simple.cpp		diff \| blob \| history
examples/speculative/speculative.cpp		diff \| blob \| history
examples/train-text-from-scratch/train-text-from-scratch.cpp		diff \| blob \| history
ggml-cuda.cu		diff \| blob \| history
llama.cpp		diff \| blob \| history
llama.h		diff \| blob \| history
tests/test-tokenizer-0-falcon.cpp		diff \| blob \| history
tests/test-tokenizer-0-llama.cpp		diff \| blob \| history
tests/test-tokenizer-1-llama.cpp		diff \| blob \| history

Packaging of ggml-org/llama.cpp

RSS Atom