git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

author	Pierrick Hymbert <redacted>
	Wed, 27 Mar 2024 19:26:49 +0000 (20:26 +0100)
committer	GitHub <redacted>
	Wed, 27 Mar 2024 19:26:49 +0000 (20:26 +0100)
commit	a016026a3ac16d8c9b993a3573f19b9556d67de4
tree	eb72bc0e48589c195d4523dd61511eb8f69c0dcc	tree
parent	53c7ec53d5eca26b2c0c648605543a5fa6c12817	commit \| diff

server: continuous performance monitoring and PR comment (#6283)

* server: bench: init

* server: bench: reduce list of GPU nodes

* server: bench: fix graph, fix output artifact

* ci: bench: add mermaid in case of image cannot be uploaded

* ci: bench: more resilient, more metrics

* ci: bench: trigger build

* ci: bench: fix duration

* ci: bench: fix typo

* ci: bench: fix mermaid values, markdown generated

* typo on the step name

Co-authored-by: Xuan Son Nguyen <redacted>
* ci: bench: trailing spaces

* ci: bench: move images in a details section

* ci: bench: reduce bullet point size

---------

Co-authored-by: Xuan Son Nguyen <redacted>

.github/workflows/bench.yml	[new file with mode: 0644]	blob
examples/server/bench/bench.py	[new file with mode: 0644]	blob
examples/server/bench/prometheus.yml	[new file with mode: 0644]	blob
examples/server/bench/requirements.txt	[new file with mode: 0644]	blob
examples/server/tests/features/steps/steps.py		diff \| blob \| history