git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Georgi Gerganov <redacted>
	Tue, 28 Oct 2025 18:19:44 +0000 (20:19 +0200)
committer	GitHub <redacted>
	Tue, 28 Oct 2025 18:19:44 +0000 (20:19 +0200)
commit	85a7d8677bf2200981e52f744a21d5267964ffcf
tree	8178fb226ade66e4c296567e9210c31c8452f3c3	tree
parent	a8ca18b4b815a2abdbecb958ee5f4c542d69aac7	commit \| diff

memory : remove KV cache size padding (#16812)

* memory : remove KV cache size padding

* cont : restore padding for n_kv tensor shape

* server : use slot context size instead of training context size

* server : simplify context limit logic

Packaging of ggml-org/llama.cpp

RSS Atom

src/llama-kv-cache.cpp		diff \| blob \| history
src/llama-kv-cache.h		diff \| blob \| history
src/llama-model.cpp		diff \| blob \| history
src/llama-model.h		diff \| blob \| history
tools/server/server.cpp		diff \| blob \| history
tools/server/tests/unit/test_ctx_shift.py		diff \| blob \| history