]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
server: continue to update other slots on embedding concurrent request (#5699)
authorPierrick Hymbert <redacted>
Sat, 24 Feb 2024 18:16:04 +0000 (19:16 +0100)
committerGitHub <redacted>
Sat, 24 Feb 2024 18:16:04 +0000 (19:16 +0100)
commit9e359a4f47c1b2dceb99e29706c9f7403d32ab5e
treeaa491d0744940ccce9ff69fe1bcc9e1f16b7a1ff
parent4c4cb30736582cacb1a164a9d4bc8e17b1014be7
server: continue to update other slots on embedding concurrent request (#5699)

* server: #5655 - continue to update other slots on embedding concurrent request.

* server: tests: add multi users embeddings as fixed

* server: tests: adding OAI compatible embedding concurrent endpoint

* server: tests: adding OAI compatible embedding with multiple inputs
examples/server/server.cpp
examples/server/tests/features/issues.feature
examples/server/tests/features/parallel.feature
examples/server/tests/features/server.feature
examples/server/tests/features/steps/steps.py