]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
server : add arg for disabling prompt caching (#18776)
authorRadoslav Gerganov <redacted>
Mon, 12 Jan 2026 17:21:34 +0000 (19:21 +0200)
committerGitHub <redacted>
Mon, 12 Jan 2026 17:21:34 +0000 (19:21 +0200)
commitbcf7546160982f56bc290d2e538544bbc0772f63
tree26b6f64dec8c66a73e43f051e93080aa0f20437f
parent36c5913c45264b1a38bccd7900d7670590650d55
server : add arg for disabling prompt caching (#18776)

* server : add arg for disabling prompt caching

Disabling prompt caching is useful for clients who are restricted to
sending only OpenAI-compat requests and want deterministic
responses.

* address review comments

* address review comments
common/arg.cpp
common/common.h
tools/server/server-task.cpp