From: Daniel Bevenius Date: Mon, 22 Jan 2024 11:11:01 +0000 (+0100) Subject: finetune : print sample-start/include-sample-start (#5072) X-Git-Tag: upstream/0.0.4488~2546 X-Git-Url: https://git.djapps.eu/?a=commitdiff_plain;h=152d9d05e097e35f1cac21262946d57faec7542a;p=pkg%2Fggml%2Fsources%2Fllama.cpp finetune : print sample-start/include-sample-start (#5072) This commit adds `--sample-start` and `--include-sample-start` to the output from the main function in finetune.cpp. The motivation for this is that even though these are set explicitly by the user via the command line, if one forgets to set them then it is useful to have their values printed out. Otherwise it is possible to go through the whole training process before realizing that the values are not what one expected. Signed-off-by: Daniel Bevenius --- diff --git a/examples/finetune/finetune.cpp b/examples/finetune/finetune.cpp index 11fcbf44..b7e19c5f 100644 --- a/examples/finetune/finetune.cpp +++ b/examples/finetune/finetune.cpp @@ -1800,6 +1800,8 @@ int main(int argc, char ** argv) { std::vector train_samples_begin; std::vector train_samples_size; printf("%s: tokenize training data from %s\n", __func__, params.common.fn_train_data); + printf("%s: sample-start: %s\n", __func__, params.common.sample_start.c_str()); + printf("%s: include-sample-start: %s\n", __func__, params.common.include_sample_start ? "true" : "false"); tokenize_file(lctx, params.common.fn_train_data, params.common.sample_start,