]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
rpc : cache and reuse compute graphs (llama/15405)
authorRadoslav Gerganov <redacted>
Fri, 28 Nov 2025 08:33:51 +0000 (10:33 +0200)
committerGeorgi Gerganov <redacted>
Fri, 12 Dec 2025 15:53:11 +0000 (17:53 +0200)
commitd26d1c8b8595a1cfd05dc4e5f324d17bcd9c2fcd
treea9e1f58c6daa15b09ecdee18ecd6b584ca845519
parentf92d542d4dfe162cae036705b8b406b71dac70a0
rpc : cache and reuse compute graphs (llama/15405)

Store the last computed graph and reuse it when possible.
Also do not return response from GRAPH_COMPUTE and assume it always
completes successfully. If this this is not the case, the server closes
the connection. This saves us a network round trip to the server.
ggml/include/ggml-rpc.h
ggml/src/ggml-rpc/ggml-rpc.cpp