]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
llama : use n_swa + n_ubatch cells for SWA cache (#13833)
authorGeorgi Gerganov <redacted>
Sat, 31 May 2025 12:57:44 +0000 (15:57 +0300)
committerGitHub <redacted>
Sat, 31 May 2025 12:57:44 +0000 (15:57 +0300)
commit3600cc2886956fc0a07ef6ad2f4128ccfdbc8c6f
treef4f866af069b8d800ae47b151344b39007486d3f
parentc7e0a2054b908c28bf93bb18d4b63ccbff2c4127
llama : use n_swa + n_ubatch cells for SWA cache (#13833)

* llama : use n_swa + n_ubatch cells for SWA cache

ggml-ci

* llama : add warning about multi-sqeuence SWA contexts
include/llama.h
src/llama-context.cpp
src/llama-kv-cache.cpp
src/llama-kv-cache.h
src/llama-model.cpp
tools/server/server.cpp