git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Georgi Gerganov <redacted>
	Sat, 31 May 2025 12:57:44 +0000 (15:57 +0300)
committer	GitHub <redacted>
	Sat, 31 May 2025 12:57:44 +0000 (15:57 +0300)
commit	3600cc2886956fc0a07ef6ad2f4128ccfdbc8c6f
tree	f4f866af069b8d800ae47b151344b39007486d3f	tree
parent	c7e0a2054b908c28bf93bb18d4b63ccbff2c4127	commit \| diff

llama : use n_swa + n_ubatch cells for SWA cache (#13833)

* llama : use n_swa + n_ubatch cells for SWA cache

ggml-ci

* llama : add warning about multi-sqeuence SWA contexts

Packaging of ggml-org/llama.cpp

RSS Atom

include/llama.h		diff \| blob \| history
src/llama-context.cpp		diff \| blob \| history
src/llama-kv-cache.cpp		diff \| blob \| history
src/llama-kv-cache.h		diff \| blob \| history
src/llama-model.cpp		diff \| blob \| history
tools/server/server.cpp		diff \| blob \| history