git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

author	ddh0 <redacted>
	Fri, 3 Oct 2025 18:34:51 +0000 (13:34 -0500)
committer	GitHub <redacted>
	Fri, 3 Oct 2025 18:34:51 +0000 (21:34 +0300)
commit	f6dcda390004b627ef30af378d0c01ad2519289e
tree	cedaf6aa7736de34e5b1e96828a0497f52005e9e	tree
parent	606a73f53175077429484b23dcf799f69a31d0bd	commit \| diff

server : context checkpointing for hybrid and recurrent models (#16382)

* initial commit for branch 3

* generalize `swa_checkpoint` to `ctx_checkpoint`

this extends `llama-server`'s SWA checkpointing logic to include
hybrid/recurrent models such as Jamba, Granite

* oops

* disable debug prints

* keep backwards compat with `--swa-checkpoints`

Co-authored-by: Georgi Gerganov <redacted>
* update prompt re-processing message

* fix off-by-one error per GG

* keep `seq_rm` log per GG

Co-authored-by: Georgi Gerganov <redacted>
* server : fix checkpoint logic to support recurrent caches

* server : cleanup and fixes

---------

Co-authored-by: Georgi Gerganov <redacted>

common/arg.cpp		diff \| blob \| history
common/common.h		diff \| blob \| history
include/llama.h		diff \| blob \| history
src/llama-kv-cache-iswa.cpp		diff \| blob \| history
src/llama-memory-hybrid.cpp		diff \| blob \| history
src/llama-memory-recurrent.cpp		diff \| blob \| history
src/llama-model.cpp		diff \| blob \| history
tools/server/server.cpp		diff \| blob \| history