]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
whisper : suppress tokens with a regex (#1997)
authorulatekh <redacted>
Tue, 9 Apr 2024 15:27:28 +0000 (08:27 -0700)
committerGitHub <redacted>
Tue, 9 Apr 2024 15:27:28 +0000 (18:27 +0300)
commitc8eeb93a6ad575f56eae1fa1e8235d995f218ee3
treea263be34880163d2e8c1612240b917670be55326
parent319fe5146e7869b496b77b6a8e541cb520eaa465
whisper : suppress tokens with a regex (#1997)

* Allow a regular expression to describe tokens to suppress.

Example: --suppress-tokens-re "[,\.]|[ ]?[0-9]+" will suppress commas, periods, and numeric tokens.

Technique inspired by https://github.com/openai/whisper/discussions/1041

Co-authored-by: Georgi Gerganov <redacted>
* Blind change to fix Java test.

---------

Co-authored-by: Georgi Gerganov <redacted>
bindings/java/src/main/java/io/github/ggerganov/whispercpp/params/WhisperFullParams.java
examples/command/command.cpp
examples/main/main.cpp
whisper.cpp
whisper.h