]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
server: continuous performance monitoring and PR comment (#6283)
authorPierrick Hymbert <redacted>
Wed, 27 Mar 2024 19:26:49 +0000 (20:26 +0100)
committerGitHub <redacted>
Wed, 27 Mar 2024 19:26:49 +0000 (20:26 +0100)
commita016026a3ac16d8c9b993a3573f19b9556d67de4
treeeb72bc0e48589c195d4523dd61511eb8f69c0dcc
parent53c7ec53d5eca26b2c0c648605543a5fa6c12817
server: continuous performance monitoring and PR comment (#6283)

* server: bench: init

* server: bench: reduce list of GPU nodes

* server: bench: fix graph, fix output artifact

* ci: bench: add mermaid in case of image cannot be uploaded

* ci: bench: more resilient, more metrics

* ci: bench: trigger build

* ci: bench: fix duration

* ci: bench: fix typo

* ci: bench: fix mermaid values, markdown generated

* typo on the step name

Co-authored-by: Xuan Son Nguyen <redacted>
* ci: bench: trailing spaces

* ci: bench: move images in a details section

* ci: bench: reduce bullet point size

---------

Co-authored-by: Xuan Son Nguyen <redacted>
.github/workflows/bench.yml [new file with mode: 0644]
examples/server/bench/bench.py [new file with mode: 0644]
examples/server/bench/prometheus.yml [new file with mode: 0644]
examples/server/bench/requirements.txt [new file with mode: 0644]
examples/server/tests/features/steps/steps.py