git.djapps.eu Git - pkg/ggml/sources/ggml/commit

author	Johannes Gäßler <redacted>
	Mon, 15 Dec 2025 08:24:59 +0000 (09:24 +0100)
committer	Georgi Gerganov <redacted>
	Wed, 17 Dec 2025 11:55:04 +0000 (13:55 +0200)
commit	0a2b6b20ac232d78185e9b1da833b3cd8af9f8aa
tree	d41d6808c66d201468a1ea994ecc8ffbe274e316	tree
parent	691a7ca3b4e124d5c3e8ce840562e21cd028eb04	commit \| diff

llama: automatically set parameters not set by the user in such a way that maximizes GPU utilization (llama/16653)

* llama: automatically fit args to free memory

llama-fit-params tool

* fix CI

* hints for bug reports, ensure no reallocation

* fix segfault with Vulkan

* add llama-fit-params to CI

* fix CI

* fix CI

* fix CI

* minor adjustments

* fix assignment of 1 dense layer

* fix logger not being reset on model load failure

* remove --n-gpu-layer hint on model load failure

* fix llama-fit-params verbosity

* fix edge case

* fix typo [no ci]

include/ggml-alloc.h		diff \| blob \| history
include/ggml-backend.h		diff \| blob \| history
include/ggml.h		diff \| blob \| history
src/ggml-alloc.c		diff \| blob \| history
src/ggml-backend.cpp		diff \| blob \| history
src/ggml.c		diff \| blob \| history