]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
server : do not default to multiple slots with speculative decoding (#17017)
authorGeorgi Gerganov <redacted>
Wed, 5 Nov 2025 12:32:55 +0000 (14:32 +0200)
committerGitHub <redacted>
Wed, 5 Nov 2025 12:32:55 +0000 (14:32 +0200)
commit13b339bcd91de64d59512f308f6f69eaca688103
tree37ac764cd454aa87fd789d979485af7be16a9f33
parent2f0c2db43e2adfa9ffbdfa1176b3b6bd9c9ed536
server : do not default to multiple slots with speculative decoding (#17017)

* server : do not default to multiple slots with speculative decoding

* cont : fix
common/common.h
tools/server/server.cpp