From: Georgi Gerganov Date: Thu, 29 Sep 2022 20:48:01 +0000 (+0300) Subject: Update README.md X-Git-Tag: upstream/1.7.4~2012 X-Git-Url: https://git.djapps.eu/?a=commitdiff_plain;h=310f4883d1d4bbba56a69f560870566ed1e4ed76;p=pkg%2Fggml%2Fsources%2Fwhisper.cpp Update README.md --- diff --git a/README.md b/README.md index 9185a3c8..5b06c126 100644 --- a/README.md +++ b/README.md @@ -96,6 +96,17 @@ main: total time = 908.15 ms The command downloads the `base.en` model converted to custom `ggml` format and runs the inference on all `.wav` samples in the folder `samples`. +For detailed usage instructions, run: `./main -h` + +Note that `whisper.cpp` currently runs only with 16-bit WAV files, so make sure to convert your input before running the tool. +For example, you can use `ffmpeg` like this: + +```java +ffmpeg -i input.mp3 -ar 16000 -ac 1 -c:a pcm_s16le output.wav +``` + +## More audio samples + If you want some extra audio samples to play with, simply run: ``` @@ -118,14 +129,7 @@ make medium make large ``` -For detailed usage instructions, run: `./main -h` - -Note that `whisper.cpp` runs only with 16-bit WAV files, so make sure to convert your input before running the tool. -For example, you can use `ffmpeg` like this: - -```java -ffmpeg -i input.mp3 -ar 16000 -ac 1 -c:a pcm_s16le output.wav -``` +## Another example Here is another example of transcribing a [3:24 min speech](https://upload.wikimedia.org/wikipedia/commons/1/1f/George_W_Bush_Columbia_FINAL.ogg) in less than a minute, using `medium.en` model: