git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Georgi Gerganov <redacted>
	Fri, 7 Nov 2025 18:03:25 +0000 (20:03 +0200)
committer	GitHub <redacted>
	Fri, 7 Nov 2025 18:03:25 +0000 (20:03 +0200)
commit	16bcc1259d311d0fd37fe00fefcc7900324d38cb
tree	292229ca321d74433d55b51f0e825d879413f916	tree
parent	9eb9a1331dec83098c858150cd0a8ad9f6d8f46c	commit \| diff

kv-cache : pad the cache size to 256 for performance (#17046)

* kv-cache : pad the size of the small SWA cache for performance

* context : pad the total context to 256

* cont : future-proof the swa pad

* server : adjust test params to new logic

include/llama.h		diff \| blob \| history
src/llama-context.cpp		diff \| blob \| history
src/llama-kv-cache-iswa.cpp		diff \| blob \| history
tools/server/tests/unit/test_speculative.py		diff \| blob \| history

Packaging of ggml-org/llama.cpp

RSS Atom