]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
server: init functional tests (#5566)
authorPierrick Hymbert <redacted>
Sat, 24 Feb 2024 11:28:55 +0000 (12:28 +0100)
committerGitHub <redacted>
Sat, 24 Feb 2024 11:28:55 +0000 (12:28 +0100)
commit525213d2f5da1eaf4b922b6b792cb52b2c613368
tree8400e8a97d231b13a2df0c9d8b7c8fa945d24d5e
parentfd43d66f46ee3b5345fb8a74a252d86ccd34a409
server: init functional tests (#5566)

* server: tests: init scenarios
 - health and slots endpoints
 - completion endpoint
 - OAI compatible chat completion requests w/ and without streaming
 - completion multi users scenario
 - multi users scenario on OAI compatible endpoint with streaming
 - multi users with total number of tokens to predict exceeds the KV Cache size
 - server wrong usage scenario, like in Infinite loop of "context shift" #3969
 - slots shifting
 - continuous batching
 - embeddings endpoint
 - multi users embedding endpoint: Segmentation fault #5655
 - OpenAI-compatible embeddings API
 - tokenize endpoint
 - CORS and api key scenario

* server: CI GitHub workflow

---------

Co-authored-by: Georgi Gerganov <redacted>
14 files changed:
.github/ISSUE_TEMPLATE/bug.md
.github/workflows/server.yml [new file with mode: 0644]
examples/server/README.md
examples/server/server.cpp
examples/server/tests/README.md [new file with mode: 0644]
examples/server/tests/features/environment.py [new file with mode: 0644]
examples/server/tests/features/issues.feature [new file with mode: 0644]
examples/server/tests/features/parallel.feature [new file with mode: 0644]
examples/server/tests/features/security.feature [new file with mode: 0644]
examples/server/tests/features/server.feature [new file with mode: 0644]
examples/server/tests/features/steps/steps.py [new file with mode: 0644]
examples/server/tests/features/wrong_usages.feature [new file with mode: 0644]
examples/server/tests/requirements.txt [new file with mode: 0644]
examples/server/tests/tests.sh [new file with mode: 0755]