Update README.md (#3289)

author Lee Drake <redacted>

Thu, 21 Sep 2023 19:00:24 +0000 (13:00 -0600)

committer GitHub <redacted>

Thu, 21 Sep 2023 19:00:24 +0000 (21:00 +0200)
author Lee Drake <redacted>
Thu, 21 Sep 2023 19:00:24 +0000 (13:00 -0600)
committer GitHub <redacted>
Thu, 21 Sep 2023 19:00:24 +0000 (21:00 +0200)
diff --git a/README.md b/README.md

index 670e2e6734f644a026812a7edb974ab295bbd38c..42686aacc970bc2e47458659f88cff7923e785eb 100644 (file)
--- a/README.md
+++ b/README.md
@@ -557,6 +557,10 @@ python3 convert.py models/7B/
  # quantize the model to 4-bits (using q4_0 method)
  ./quantize ./models/7B/ggml-model-f16.gguf ./models/7B/ggml-model-q4_0.gguf q4_0
  
+# update the gguf filetype to current if older version is unsupported by another application
+./quantize ./models/7B/ggml-model-q4_0.gguf ./models/7B/ggml-model-q4_0-v2.gguf COPY
+
+
  # run the inference
  ./main -m ./models/7B/ggml-model-q4_0.gguf -n 128
  ```
author	Lee Drake <redacted>
	Thu, 21 Sep 2023 19:00:24 +0000 (13:00 -0600)
committer	GitHub <redacted>
	Thu, 21 Sep 2023 19:00:24 +0000 (21:00 +0200)