From: Daniel Bevenius Date: Fri, 5 Sep 2025 02:36:23 +0000 (+0200) Subject: model-conversion : add --embeddings flag to modelcard.template [no ci] (#15801) X-Git-Tag: upstream/0.0.6527~139 X-Git-Url: https://git.djapps.eu/?a=commitdiff_plain;h=5d6688de08e73acc2532d668380801ed79d704eb;p=pkg%2Fggml%2Fsources%2Fllama.cpp model-conversion : add --embeddings flag to modelcard.template [no ci] (#15801) This commit updates the modelcard.template file used in the model conversion scripts for embedding models to include the llama-server --embeddings flag in the recommended command to run the model. The motivation for this change was that when using the model-conversion "tool" to upload the EmbeddingGemma models to Hugging Face this flag was missing and the embedding endpoint was there for not available when copy&pasting the command. --- diff --git a/examples/model-conversion/scripts/embedding/modelcard.template b/examples/model-conversion/scripts/embedding/modelcard.template index 75c58052..9e63042b 100644 --- a/examples/model-conversion/scripts/embedding/modelcard.template +++ b/examples/model-conversion/scripts/embedding/modelcard.template @@ -7,7 +7,7 @@ base_model: Recommended way to run this model: ```sh -llama-server -hf {namespace}/{model_name}-GGUF +llama-server -hf {namespace}/{model_name}-GGUF --embeddings ``` Then the endpoint can be accessed at http://localhost:8080/embedding, for