]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
server: Add "tokens per second" information in the backend (#10548)
authorhaopeng <redacted>
Mon, 2 Dec 2024 13:45:54 +0000 (21:45 +0800)
committerGitHub <redacted>
Mon, 2 Dec 2024 13:45:54 +0000 (14:45 +0100)
commit64ed2091b24b2f9747148fdf49a34ed5938762c3
tree01e69f3a46744b868b8930941882928927395c9b
parent991f8aabeec89d801300bb179e52013fb0eb0584
server: Add "tokens per second" information in the backend (#10548)

* add cmake rvv support

* add timings

* remove space

* update readme

* fix

* fix code

* remove empty line

* add test

---------

Co-authored-by: Xuan Son Nguyen <redacted>
common/common.h
examples/server/README.md
examples/server/server.cpp
examples/server/tests/unit/test_chat_completion.py
examples/server/utils.hpp