From: Daniel Bevenius <redacted>
Date: Fri, 5 Sep 2025 02:36:23 +0000 (+0200)
Subject: model-conversion : add --embeddings flag to modelcard.template [no ci] (#15801)
X-Git-Tag: upstream/0.0.6527~139
X-Git-Url: https://git.djapps.eu/?a=commitdiff_plain;h=5d6688de08e73acc2532d668380801ed79d704eb;p=pkg%2Fggml%2Fsources%2Fllama.cpp

model-conversion : add --embeddings flag to modelcard.template [no ci] (#15801)

This commit updates the modelcard.template file used in the model
conversion scripts for embedding models to include the llama-server
--embeddings flag in the recommended command to run the model.

The motivation for this change was that when using the model-conversion
"tool" to upload the EmbeddingGemma models to Hugging Face this flag was
missing and the embedding endpoint was there for not available when
copy&pasting the command.
---

diff --git a/examples/model-conversion/scripts/embedding/modelcard.template b/examples/model-conversion/scripts/embedding/modelcard.template
index 75c580524..9e63042b7 100644
--- a/examples/model-conversion/scripts/embedding/modelcard.template
+++ b/examples/model-conversion/scripts/embedding/modelcard.template
@@ -7,7 +7,7 @@ base_model:
 Recommended way to run this model:
 
 ```sh
-llama-server -hf {namespace}/{model_name}-GGUF
+llama-server -hf {namespace}/{model_name}-GGUF --embeddings
 ```
 
 Then the endpoint can be accessed at http://localhost:8080/embedding, for