From: David Yang Date: Sat, 17 Jun 2023 06:51:54 +0000 (+0800) Subject: train : get raw text instead of page with html (#1905) X-Git-Tag: gguf-v0.4.0~616 X-Git-Url: https://git.djapps.eu/?a=commitdiff_plain;h=92f20d9942c86daeb78637bdad7296a572f4da28;p=pkg%2Fggml%2Fsources%2Fllama.cpp train : get raw text instead of page with html (#1905) We probably want to train using just the text of Shakespeare instead of the html of the page displaying his work. --- diff --git a/examples/train-text-from-scratch/README.md b/examples/train-text-from-scratch/README.md index 5344d1f5..726ec47c 100644 --- a/examples/train-text-from-scratch/README.md +++ b/examples/train-text-from-scratch/README.md @@ -4,7 +4,7 @@ Basic usage instructions: ```bash # get training data -wget https://github.com/brunoklein99/deep-learning-notes/blob/master/shakespeare.txt +wget https://raw.githubusercontent.com/brunoklein99/deep-learning-notes/master/shakespeare.txt # train ./bin/train-text-from-scratch \