]>
git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/log
Georgi Gerganov [Mon, 10 Apr 2023 20:18:29 +0000 (23:18 +0300)]
ggml : fix WASM build
Georgi Gerganov [Mon, 10 Apr 2023 20:09:15 +0000 (23:09 +0300)]
talk-llama : increase context to 2048
Georgi Gerganov [Mon, 10 Apr 2023 19:59:13 +0000 (22:59 +0300)]
talk-llama : update to latest llama.cpp (improved performance)
Georgi Gerganov [Mon, 10 Apr 2023 19:28:54 +0000 (22:28 +0300)]
ggml : backport llama.cpp updates (close #709)
- About x2 overall performance improvement on Apple Silicon
- Results should now be the same for different number of threads (not
tested)
pajowu [Thu, 30 Mar 2023 17:29:29 +0000 (19:29 +0200)]
whisper : add progress callback (#600)
Zigfrid Zvezdin [Thu, 30 Mar 2023 04:51:33 +0000 (01:51 -0300)]
misc : typo (#688)
InconsolableCellist [Wed, 29 Mar 2023 21:10:20 +0000 (15:10 -0600)]
talk-llama : fixing usage message for talk-llama (#687)
"-ml" instead of "-mg" for specifying the llama file
Georgi Gerganov [Wed, 29 Mar 2023 20:59:45 +0000 (23:59 +0300)]
main : add <cstring> header
Lucas Zanek [Wed, 29 Mar 2023 20:59:17 +0000 (17:59 -0300)]
whisper.addon : fixed test to new async implementation (#686)
* fixed blocking code on node addon
* modify the example to run async
* format
* added logic to see the whisper output
* added logic to see the whisper output
* removed extra function for more clean example
* fixed whisper test to new async implementation
be-next [Wed, 29 Mar 2023 20:38:33 +0000 (22:38 +0200)]
models : handle spaces and special characters in shell script paths (#677)
This commit modifies the `get_script_path` function to correctly handle
spaces and special characters in directory paths. The fix involves adding
double quotes around variables and commands where needed to ensure proper
parsing of paths with spaces and special characters.
Egor Egorov [Wed, 29 Mar 2023 20:26:39 +0000 (23:26 +0300)]
main : fix typo in JSON output (#648)
* typo in JSON output
* fix double quotes in JSON output
Jhen-Jie Hong [Wed, 29 Mar 2023 20:23:23 +0000 (04:23 +0800)]
whisper : add initial_prompt param (#645)
clach04 [Wed, 29 Mar 2023 20:11:35 +0000 (13:11 -0700)]
make : 32-bit ARM flags (#486)
* issue #470 - working 32-bit ARM
* Update Makefile
* Update Makefile
---------
Co-authored-by: Georgi Gerganov <redacted>
Jonno [Wed, 29 Mar 2023 20:04:38 +0000 (06:04 +1000)]
whisper.swiftui : update README.md (#682)
- Slight tweaks to README for improved comprehension.
Evan Jones [Wed, 29 Mar 2023 20:01:14 +0000 (16:01 -0400)]
talk-llama : add alpaca support (#668)
Georgi Gerganov [Tue, 28 Mar 2023 07:50:49 +0000 (10:50 +0300)]
whisper : bump "large" scratch buffer even mode (close #671)
Georgi Gerganov [Tue, 28 Mar 2023 07:36:16 +0000 (10:36 +0300)]
whisper : increase scratch buffers after recent change (#671)
Should fix the error:
ggml_new_tensor_impl: not enough space in the scratch memory
Georgi Gerganov [Tue, 28 Mar 2023 07:11:34 +0000 (10:11 +0300)]
talk-llama : add discussion link
Georgi Gerganov [Mon, 27 Mar 2023 18:28:00 +0000 (21:28 +0300)]
talk-llama : try to fix windows build ..
Georgi Gerganov [Mon, 27 Mar 2023 18:02:35 +0000 (21:02 +0300)]
readme : add talk-llama example to the table
Georgi Gerganov [Mon, 27 Mar 2023 18:00:32 +0000 (21:00 +0300)]
talk-llama : add new example + sync ggml from llama.cpp (#664)
* talk-llama : talk with LLaMA AI
* talk.llama : disable EOS token
* talk-llama : add README instructions
* ggml : fix build in debug
Georgi Gerganov [Wed, 22 Mar 2023 20:34:39 +0000 (22:34 +0200)]
whisper : disable fallbacks until the performance is improved (#588)
Andrew Huynh [Wed, 22 Mar 2023 20:30:40 +0000 (13:30 -0700)]
cmake : add a flag to disable F16C (#628)
jwijffels [Wed, 22 Mar 2023 20:28:22 +0000 (21:28 +0100)]
Include link to R wrapper in README (#626)
Lucas Zanek [Wed, 22 Mar 2023 20:19:22 +0000 (17:19 -0300)]
Nodejs Addon blocking main thread. Implemented Napi::AsyncWorker (#642)
* fixed blocking code on node addon
* modify the example to run async
* format
* added logic to see the whisper output
* added logic to see the whisper output
* removed extra function for more clean example
Jhen-Jie Hong [Wed, 22 Mar 2023 20:16:04 +0000 (04:16 +0800)]
whisper.objc : add `-O3 -DNDEBUG` in release mode (#640)
sandrohanea [Wed, 22 Mar 2023 19:47:09 +0000 (20:47 +0100)]
fixed language auto-detection for state provided processing (#627)
Co-authored-by: Sandro Hanea <redacted>
Jhen-Jie Hong [Wed, 22 Mar 2023 19:39:02 +0000 (03:39 +0800)]
readme : add react-native bindings (#619)
Leo Moll [Wed, 22 Mar 2023 19:37:36 +0000 (20:37 +0100)]
main : provide option for creating JSON output (#615)
* examples : provide option for exporting also as JSON file (ggerganov/whisper.cpp#614)
* main : remove leftovers
---------
Co-authored-by: Georgi Gerganov <redacted>
Kamilake [Wed, 22 Mar 2023 19:17:24 +0000 (04:17 +0900)]
models : change default encoding to utf8 (#605)
Georgi Gerganov [Wed, 22 Mar 2023 18:51:42 +0000 (20:51 +0200)]
make : fix MUSL Linux build (#576)
Georgi Gerganov [Wed, 22 Mar 2023 18:44:56 +0000 (20:44 +0200)]
models : change HF hosting from dataset to model
Takeshi Inoue [Tue, 7 Mar 2023 19:36:30 +0000 (04:36 +0900)]
whisper.android : support benchmark for Android example. (#542)
* whisper.android: Support benchmark for Android example.
* whisper.android: update screenshot in README.
* update: Make text selectable for copy & paste.
* Update whisper.h to restore API name
Co-authored-by: Georgi Gerganov <redacted>
* whisper.android: Restore original API names.
---------
Co-authored-by: tinoue <redacted>
Co-authored-by: Georgi Gerganov <redacted>
Georgi Gerganov [Mon, 6 Mar 2023 19:06:27 +0000 (21:06 +0200)]
readme : add bench-wts.sh demo
Georgi Gerganov [Mon, 6 Mar 2023 19:02:24 +0000 (21:02 +0200)]
bench-wts.sh : rename script + add execute permission
venkr [Mon, 6 Mar 2023 17:18:11 +0000 (09:18 -0800)]
qual-bench.sh : add quality comparison tool, and update main.cpp to allow using a font file (#569)
Takeshi Inoue [Mon, 6 Mar 2023 17:15:57 +0000 (02:15 +0900)]
whisper.android : enable fp16 instrinsics (FP16_VA) which is supported by ARMv8.2 or later. (#572)
sandrohanea [Sun, 5 Mar 2023 19:42:19 +0000 (20:42 +0100)]
whisper : add whisper_state + default state on the whisper_context (#523)
* Added whisper state + default state on the whisper_context
* Fixed some examples and bindings
* Fixed whisper_n_len (which was used in some binding) and added whisper_n_len_from_state
* Fixed comments
* whisper : reuse kv_cache_free() and fix compiler warnings
* whisper : clean-up the API comments
---------
Co-authored-by: Sandro Hanea <redacted>
Co-authored-by: Georgi Gerganov <redacted>
Georgi Gerganov [Sun, 5 Mar 2023 18:53:43 +0000 (20:53 +0200)]
whisper : set no_context == true by default (#537)
polarmoon [Sun, 5 Mar 2023 18:50:25 +0000 (10:50 -0800)]
go : NewContext now returns a clean context (#537)
Co-authored-by: Ming <redacted>
HY. Kelvin Lee [Thu, 2 Mar 2023 16:32:16 +0000 (11:32 -0500)]
main : add csv header (#552)
Georgi Gerganov [Tue, 28 Feb 2023 21:27:54 +0000 (23:27 +0200)]
make : add -DNDEBUG compile flag
Georgi Gerganov [Tue, 28 Feb 2023 20:29:12 +0000 (22:29 +0200)]
release : v1.2.1
FlippFuzz [Mon, 27 Feb 2023 19:04:16 +0000 (03:04 +0800)]
make : add "-mcpu=native" when building for aarch64 (#532)
Aaron Pham [Mon, 27 Feb 2023 19:02:11 +0000 (11:02 -0800)]
readme : add pybind11 bindings (#538)
Georgi Gerganov [Fri, 24 Feb 2023 06:46:06 +0000 (08:46 +0200)]
readme : add cython bindings (#9)
Georgi Gerganov [Tue, 21 Feb 2023 17:00:42 +0000 (19:00 +0200)]
whisper : zero-initialize some more context variables
Just in case
Finn Voorhees [Tue, 21 Feb 2023 11:42:10 +0000 (11:42 +0000)]
whisper : fix uninitialized exp_n_audio_ctx
Georgi Gerganov [Sun, 19 Feb 2023 16:35:01 +0000 (18:35 +0200)]
whisper : add API for applying custom logits filters during decoding
Georgi Gerganov [Sat, 18 Feb 2023 07:42:31 +0000 (09:42 +0200)]
yt-wsp.sh : print help on empty args
Georgi Gerganov [Wed, 15 Feb 2023 19:48:49 +0000 (21:48 +0200)]
whisper : by default disable non-speech tokens suppression (#473)
This seems to be causing hallucinations in the end of the audio, e.g.:
"Thank you for listening"
"Amen"
..
Georgi Gerganov [Wed, 15 Feb 2023 17:51:54 +0000 (19:51 +0200)]
readme : add Ruby discussion + update .NET discussion
Todd [Wed, 15 Feb 2023 17:46:55 +0000 (12:46 -0500)]
bindings : add Ruby (#500)
* adding ruby bindings
* avoid adding these they are copied in via extconf.rb
* ignore these files here
* add definitions for boolean params
* initial transcribe for ruby
* use en model and transcribe jfk with assertion
* possibly this works for building ruby binding
* ci : try to add ruby workflow
---------
Co-authored-by: Georgi Gerganov <redacted>
conradg [Wed, 15 Feb 2023 17:31:16 +0000 (17:31 +0000)]
main : fix std in input (#503)
if we don't add this as an explicit check, then we get an "error: unknown argument: -" later on
Georgi Gerganov [Wed, 15 Feb 2023 17:28:10 +0000 (19:28 +0200)]
examples : refactor in order to reuse code and reduce duplication (#482)
* examples : refactor common code into a library
* examples : refactor common SDL code into a library
* make : update Makefile to use common libs
* common : fix MSVC M_PI ..
* addon.node : link common lib
shikokuchuo [Wed, 15 Feb 2023 17:08:25 +0000 (17:08 +0000)]
whisper : fix signedness compiler warning (#506)
genevera (she/her) [Tue, 14 Feb 2023 18:12:51 +0000 (13:12 -0500)]
yt-wsp.sh : add unique filename generation (#495)
Co-authored-by: genevera <redacted>
Georgi Gerganov [Tue, 14 Feb 2023 18:04:03 +0000 (20:04 +0200)]
readme : add another .NET repo (#303)
Georgi Gerganov [Sat, 11 Feb 2023 15:35:33 +0000 (17:35 +0200)]
readme : add .NET repo (#303)
Avik Sengupta [Sat, 11 Feb 2023 07:13:32 +0000 (07:13 +0000)]
cmake : install whisper.h header (#485)
Including the header file in the install bundle helps projects that ship binaries.
shibukazu [Wed, 8 Feb 2023 07:05:34 +0000 (16:05 +0900)]
whisper : suppress non-speech-related token outputs (#473)
* add non-speech-token suppression
* add suppress non-speech_tokens param
sandrohanea [Wed, 8 Feb 2023 07:01:47 +0000 (08:01 +0100)]
whisper : fixed Beam Search Strategy and exposed whisper_pcm_to_mel_phase_vocoder (#474)
Co-authored-by: Sandro Hanea <redacted>
boolemancer [Wed, 8 Feb 2023 06:43:23 +0000 (22:43 -0800)]
whisper : only trim if split_on_word is true (#476)
Qianhe Chen [Sun, 5 Feb 2023 13:02:08 +0000 (21:02 +0800)]
ci : add node addon test and optimize compilation configuration (#468)
* addon: implement node addon call whisper through cpp
* addon: modify the license to MIT
* addon: remove iostream
* addon: rename dir
* addon: fix typo
* addon: configure cmake to build when cmake-js is used
* ci: add addon.node test ci
* addon: remove build WHISPER_BUILD_TESTS
* addon: update build command
* addon: add test
* addon: add test file
* addon: adapt to compile on Windows
* addon: fix typo
* addon: reuse jfk.wav
Co-authored-by: Georgi Gerganov <redacted>
* addon: reuse jfk.wav
---------
Co-authored-by: Georgi Gerganov <redacted>
kamranjon [Sun, 5 Feb 2023 12:46:26 +0000 (04:46 -0800)]
whisper : add whisper_full_lang_id() for getting the context lang (#461)
Matija Pevec [Sun, 5 Feb 2023 12:44:23 +0000 (13:44 +0100)]
whisper : add "split_on_word" flag when using using "max_len" option (#455)
* Update whisper.cpp
* fix: trim function
* feat: added flag to split on word
* fix: arguments for main
Georgi Gerganov [Sat, 4 Feb 2023 07:49:49 +0000 (09:49 +0200)]
release : v1.2.0
Georgi Gerganov [Sat, 4 Feb 2023 07:45:52 +0000 (09:45 +0200)]
whisper : reduce memory usage during inference (#431)
* ggml : add "scratch" buffer support
* ggml : support for scratch ring-buffer
* ggml : bug fix in ggml_repeat()
* ggml : error on scratch buffer overflow
* whisper : use scratch buffers during inference (base model only)
* whisper : update memory usage for all models
* whisper : fix encoder memory usage
* whisper : use whisper_context functions instead of macros
* whisper : fix FF + remove it from README
* ggml : reuse ggml_new_i32
* ggml : refactor the scratch buffer storage
* whisper : reorder scratch buffers in the decoder
* main : add option to disable temp fallback
* Update README.md
Qianhe Chen [Sat, 4 Feb 2023 07:10:25 +0000 (15:10 +0800)]
addon.node : using whisper as a Node.js addon (#443)
* addon: implement node addon call whisper through cpp
* addon: modify the license to MIT
* addon: remove iostream
* addon: rename dir
* addon: fix typo
* addon: configure cmake to build when cmake-js is used
polarmoon [Sat, 4 Feb 2023 07:09:27 +0000 (23:09 -0800)]
go : support "auto" as an option when set language (#462)
Co-authored-by: Ming <redacted>
Georgi Gerganov [Sat, 4 Feb 2023 07:03:14 +0000 (09:03 +0200)]
whisper : suppress task tokens (#442)
Taisei Mima [Sat, 4 Feb 2023 06:49:15 +0000 (15:49 +0900)]
wasm : fix typo in helper.js (#459)
Alex Bacart [Sat, 4 Feb 2023 06:48:35 +0000 (09:48 +0300)]
main : CSV format export trimmed spaces fix (#444)
* Update main.cpp
Removed string trimming
* Update main.cpp
* Update main.cpp
* Revert "Update main.cpp"
This reverts commit
d8924fdcfe8a2980db0af47d42ec206fc1a3e9d0 .
* Revert "Update main.cpp"
This reverts commit
252e508d8597231c61380ebea42b3707714345cc .
Lukas Rist [Sat, 28 Jan 2023 16:44:56 +0000 (17:44 +0100)]
go : add wrapper for system info (#456)
Robin [Thu, 26 Jan 2023 23:14:20 +0000 (15:14 -0800)]
go : add WhisperLangAutoDetect method to go binding (#451)
Eric Tendian [Thu, 26 Jan 2023 23:13:31 +0000 (17:13 -0600)]
livestream.sh : run main with model arg instead of default (#453)
Actually utilizes the $model var when calling ./main.
Georgi Gerganov [Wed, 25 Jan 2023 17:59:57 +0000 (19:59 +0200)]
whisper : do not provide past prompt when n_max_text_ctx == 0
Georgi Gerganov [Wed, 25 Jan 2023 17:59:15 +0000 (19:59 +0200)]
wasm : check if navigator.storage.estimate() is available
Safari does not support it
Lukas Rist [Wed, 25 Jan 2023 16:57:30 +0000 (17:57 +0100)]
go : added wrappers to reset and print timings (#436)
Georgi Gerganov [Wed, 25 Jan 2023 16:50:50 +0000 (18:50 +0200)]
ci : run workflows on pull requests + bindings depend on .h (#446)
Ondrej Kokes [Mon, 23 Jan 2023 18:49:00 +0000 (19:49 +0100)]
whisper.wasm : add labels for easier radio selection (#435)
Georgi Gerganov [Mon, 23 Jan 2023 18:48:26 +0000 (20:48 +0200)]
whisper : condition timestamps to be monotonically increasing (#425)
fitzsim [Mon, 23 Jan 2023 18:48:10 +0000 (18:48 +0000)]
whisper : PPC64 big-endian support (#398)
* ggml : set cache line size to 128 on POWER9
* whisper : add PPC64 big endian support
Georgi Gerganov [Mon, 23 Jan 2023 18:23:44 +0000 (20:23 +0200)]
release : v1.1.1
Georgi Gerganov [Mon, 23 Jan 2023 18:19:04 +0000 (20:19 +0200)]
.gitignore : add arm_neon.h
Georgi Gerganov [Sun, 22 Jan 2023 08:47:01 +0000 (10:47 +0200)]
whisper : fix condition for providing past prompt (critical)
This bug has been present since v1.1.0.
Effectively, the past transcribed text wasn't being used for following
transcriptions, which likely significantly reduces the transcription
quality.
Likely related to #419
Georgi Gerganov [Thu, 19 Jan 2023 16:50:33 +0000 (18:50 +0200)]
whisper : extend information in whisper_print_timings()
Georgi Gerganov [Wed, 18 Jan 2023 20:52:18 +0000 (22:52 +0200)]
whisper : perform entropy check only when we have at least 32 tokens (#412)
Georgi Gerganov [Wed, 18 Jan 2023 20:51:41 +0000 (22:51 +0200)]
main : we had accidentally disabled the temperature fallback .. (#291)
Georgi Gerganov [Wed, 18 Jan 2023 19:58:55 +0000 (21:58 +0200)]
whisper.wasm : add small and small.en models
Georgi Gerganov [Wed, 18 Jan 2023 19:39:32 +0000 (21:39 +0200)]
bench : minor fixes
Chia-Hsiang Cheng [Wed, 18 Jan 2023 19:26:31 +0000 (03:26 +0800)]
main : add an option to accept optional output filenames (#424)
* Add an option to accept optional output filenames
* Format the file
Co-authored-by: Chia-Hsiang Cheng <redacted>
Georgi Gerganov [Wed, 18 Jan 2023 19:22:52 +0000 (21:22 +0200)]
stream : fix handling of --step == --length (#416)
Georgi Gerganov [Wed, 18 Jan 2023 19:00:41 +0000 (21:00 +0200)]
bench : fix Windows linkage by moving ggml benches in whisper lib ..
Georgi Gerganov [Wed, 18 Jan 2023 18:31:46 +0000 (20:31 +0200)]
bench : add memcpy and ggml_mul_mat benchmarks
Digipom [Mon, 16 Jan 2023 19:57:35 +0000 (14:57 -0500)]
whisper.android : add support for loading directly from asset in C (#415)
Georgi Gerganov [Mon, 16 Jan 2023 19:44:40 +0000 (21:44 +0200)]
whisper : fix possible uninitialized variables (#291)
Georgi Gerganov [Mon, 16 Jan 2023 17:37:06 +0000 (19:37 +0200)]
stream : fix --keep_context argument to be used correctly (#354)
Damian Czaja [Mon, 16 Jan 2023 17:18:10 +0000 (18:18 +0100)]
go : remove sample_best and sample_timestamp bindings (#409)
Georgi Gerganov [Mon, 16 Jan 2023 16:47:31 +0000 (18:47 +0200)]
Update README.md