git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Johannes Gäßler <redacted>
	Mon, 15 Dec 2025 08:24:59 +0000 (09:24 +0100)
committer	GitHub <redacted>
	Mon, 15 Dec 2025 08:24:59 +0000 (09:24 +0100)
commit	b1f3a6e5db7b782ef077bd0e8253ce03283b1f37
tree	ba0475a546443cc22e981d7f4056a5c6b7160466	tree
parent	4aced7a63156555911157d3002f9d3ddef4a1e55	commit \| diff

llama: automatically set parameters not set by the user in such a way that maximizes GPU utilization (#16653)

* llama: automatically fit args to free memory

llama-fit-params tool

* fix CI

* hints for bug reports, ensure no reallocation

* fix segfault with Vulkan

* add llama-fit-params to CI

* fix CI

* fix CI

* fix CI

* minor adjustments

* fix assignment of 1 dense layer

* fix logger not being reset on model load failure

* remove --n-gpu-layer hint on model load failure

* fix llama-fit-params verbosity

* fix edge case

* fix typo [no ci]

26 files changed:

.github/ISSUE_TEMPLATE/011-bug-results.yml		diff \| blob \| history
ci/run.sh		diff \| blob \| history
common/arg.cpp		diff \| blob \| history
common/common.cpp		diff \| blob \| history
common/common.h		diff \| blob \| history
ggml/include/ggml-alloc.h		diff \| blob \| history
ggml/include/ggml-backend.h		diff \| blob \| history
ggml/include/ggml.h		diff \| blob \| history
ggml/src/ggml-alloc.c		diff \| blob \| history
ggml/src/ggml-backend.cpp		diff \| blob \| history
ggml/src/ggml.c		diff \| blob \| history
include/llama.h		diff \| blob \| history
src/llama-context.cpp		diff \| blob \| history
src/llama-context.h		diff \| blob \| history
src/llama-hparams.h		diff \| blob \| history
src/llama-impl.cpp		diff \| blob \| history
src/llama-kv-cache.cpp		diff \| blob \| history
src/llama-model-loader.cpp		diff \| blob \| history
src/llama-model-loader.h		diff \| blob \| history
src/llama-model.cpp		diff \| blob \| history
src/llama-quant.cpp		diff \| blob \| history
src/llama.cpp		diff \| blob \| history
tools/CMakeLists.txt		diff \| blob \| history
tools/fit-params/CMakeLists.txt	[new file with mode: 0644]	blob
tools/fit-params/README.md	[new file with mode: 0644]	blob
tools/fit-params/fit-params.cpp	[new file with mode: 0644]	blob

Packaging of ggml-org/llama.cpp

RSS Atom