git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Xuan-Son Nguyen <redacted>
	Mon, 8 Dec 2025 13:35:28 +0000 (14:35 +0100)
committer	GitHub <redacted>
	Mon, 8 Dec 2025 13:35:28 +0000 (14:35 +0100)
commit	f896d2c34f7bb502c13986830b3ed7d85aac67d9
tree	15ac8a65596761fba6345ddf0f55cd0db949227a	tree
parent	e4e9c4329c088d3aa97b8c242e18ff79bfe66248	commit \| diff

server: improve speed of speculative decoding (#17808)

* server: improve speed of speculative decoding

* fix small draft case

* add link to the PR

* server : fix generation time measurement

* server : fix draft acceptance logs (add SRV_CNT, SLT_CNT macros)

* server : add comment

* add PR to docs

---------

Co-authored-by: Georgi Gerganov <redacted>

tools/server/README-dev.md		diff \| blob \| history
tools/server/server-common.h		diff \| blob \| history
tools/server/server-context.cpp		diff \| blob \| history

Packaging of ggml-org/llama.cpp

RSS Atom