From: KITAITI Makoto Date: Fri, 23 May 2025 08:38:26 +0000 (+0900) Subject: docs : fix VAD section heading levels (#3186) X-Git-Url: https://git.djapps.eu/?a=commitdiff_plain;h=13d92d08ae26031545921243256aaaf0ee057943;p=pkg%2Fggml%2Fsources%2Fwhisper.cpp docs : fix VAD section heading levels (#3186) --- diff --git a/README.md b/README.md index 8b010a72..44ebc41e 100644 --- a/README.md +++ b/README.md @@ -733,7 +733,7 @@ let package = Package( ) ``` -### Voice Activity Detection (VAD) +## Voice Activity Detection (VAD) Support for Voice Activity Detection (VAD) can be enabled using the `--vad` argument to `whisper-cli`. In addition to this option a VAD model is also required. @@ -747,7 +747,7 @@ transcription process. The following VAD models are currently supported: -#### Silero-VAD +### Silero-VAD [Silero-vad](https://github.com/snakers4/silero-vad) is a lightweight VAD model written in Python that is fast and accurate. @@ -792,7 +792,7 @@ $ ./build/bin/whisper-cli \ --vad-model ./models/silero-v5.1.2-ggml.bin ``` -#### VAD Options +### VAD Options * --vad-threshold: Threshold probability for speech detection. A probability for a speech segment/frame above this threshold will be considered as speech.