]>
git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/log
Egor Egorov [Wed, 29 Mar 2023 20:26:39 +0000 (23:26 +0300)]
main : fix typo in JSON output (#648)
* typo in JSON output
* fix double quotes in JSON output
Jhen-Jie Hong [Wed, 29 Mar 2023 20:23:23 +0000 (04:23 +0800)]
whisper : add initial_prompt param (#645)
clach04 [Wed, 29 Mar 2023 20:11:35 +0000 (13:11 -0700)]
make : 32-bit ARM flags (#486)
* issue #470 - working 32-bit ARM
* Update Makefile
* Update Makefile
---------
Co-authored-by: Georgi Gerganov <redacted>
Jonno [Wed, 29 Mar 2023 20:04:38 +0000 (06:04 +1000)]
whisper.swiftui : update README.md (#682)
- Slight tweaks to README for improved comprehension.
Evan Jones [Wed, 29 Mar 2023 20:01:14 +0000 (16:01 -0400)]
talk-llama : add alpaca support (#668)
Georgi Gerganov [Tue, 28 Mar 2023 07:50:49 +0000 (10:50 +0300)]
whisper : bump "large" scratch buffer even mode (close #671)
Georgi Gerganov [Tue, 28 Mar 2023 07:36:16 +0000 (10:36 +0300)]
whisper : increase scratch buffers after recent change (#671)
Should fix the error:
ggml_new_tensor_impl: not enough space in the scratch memory
Georgi Gerganov [Tue, 28 Mar 2023 07:11:34 +0000 (10:11 +0300)]
talk-llama : add discussion link
Georgi Gerganov [Mon, 27 Mar 2023 18:28:00 +0000 (21:28 +0300)]
talk-llama : try to fix windows build ..
Georgi Gerganov [Mon, 27 Mar 2023 18:02:35 +0000 (21:02 +0300)]
readme : add talk-llama example to the table
Georgi Gerganov [Mon, 27 Mar 2023 18:00:32 +0000 (21:00 +0300)]
talk-llama : add new example + sync ggml from llama.cpp (#664)
* talk-llama : talk with LLaMA AI
* talk.llama : disable EOS token
* talk-llama : add README instructions
* ggml : fix build in debug
Georgi Gerganov [Wed, 22 Mar 2023 20:34:39 +0000 (22:34 +0200)]
whisper : disable fallbacks until the performance is improved (#588)
Andrew Huynh [Wed, 22 Mar 2023 20:30:40 +0000 (13:30 -0700)]
cmake : add a flag to disable F16C (#628)
jwijffels [Wed, 22 Mar 2023 20:28:22 +0000 (21:28 +0100)]
Include link to R wrapper in README (#626)
Lucas Zanek [Wed, 22 Mar 2023 20:19:22 +0000 (17:19 -0300)]
Nodejs Addon blocking main thread. Implemented Napi::AsyncWorker (#642)
* fixed blocking code on node addon
* modify the example to run async
* format
* added logic to see the whisper output
* added logic to see the whisper output
* removed extra function for more clean example
Jhen-Jie Hong [Wed, 22 Mar 2023 20:16:04 +0000 (04:16 +0800)]
whisper.objc : add `-O3 -DNDEBUG` in release mode (#640)
sandrohanea [Wed, 22 Mar 2023 19:47:09 +0000 (20:47 +0100)]
fixed language auto-detection for state provided processing (#627)
Co-authored-by: Sandro Hanea <redacted>
Jhen-Jie Hong [Wed, 22 Mar 2023 19:39:02 +0000 (03:39 +0800)]
readme : add react-native bindings (#619)
Leo Moll [Wed, 22 Mar 2023 19:37:36 +0000 (20:37 +0100)]
main : provide option for creating JSON output (#615)
* examples : provide option for exporting also as JSON file (ggerganov/whisper.cpp#614)
* main : remove leftovers
---------
Co-authored-by: Georgi Gerganov <redacted>
Kamilake [Wed, 22 Mar 2023 19:17:24 +0000 (04:17 +0900)]
models : change default encoding to utf8 (#605)
Georgi Gerganov [Wed, 22 Mar 2023 18:51:42 +0000 (20:51 +0200)]
make : fix MUSL Linux build (#576)
Georgi Gerganov [Wed, 22 Mar 2023 18:44:56 +0000 (20:44 +0200)]
models : change HF hosting from dataset to model
Takeshi Inoue [Tue, 7 Mar 2023 19:36:30 +0000 (04:36 +0900)]
whisper.android : support benchmark for Android example. (#542)
* whisper.android: Support benchmark for Android example.
* whisper.android: update screenshot in README.
* update: Make text selectable for copy & paste.
* Update whisper.h to restore API name
Co-authored-by: Georgi Gerganov <redacted>
* whisper.android: Restore original API names.
---------
Co-authored-by: tinoue <redacted>
Co-authored-by: Georgi Gerganov <redacted>
Georgi Gerganov [Mon, 6 Mar 2023 19:06:27 +0000 (21:06 +0200)]
readme : add bench-wts.sh demo
Georgi Gerganov [Mon, 6 Mar 2023 19:02:24 +0000 (21:02 +0200)]
bench-wts.sh : rename script + add execute permission
venkr [Mon, 6 Mar 2023 17:18:11 +0000 (09:18 -0800)]
qual-bench.sh : add quality comparison tool, and update main.cpp to allow using a font file (#569)
Takeshi Inoue [Mon, 6 Mar 2023 17:15:57 +0000 (02:15 +0900)]
whisper.android : enable fp16 instrinsics (FP16_VA) which is supported by ARMv8.2 or later. (#572)
sandrohanea [Sun, 5 Mar 2023 19:42:19 +0000 (20:42 +0100)]
whisper : add whisper_state + default state on the whisper_context (#523)
* Added whisper state + default state on the whisper_context
* Fixed some examples and bindings
* Fixed whisper_n_len (which was used in some binding) and added whisper_n_len_from_state
* Fixed comments
* whisper : reuse kv_cache_free() and fix compiler warnings
* whisper : clean-up the API comments
---------
Co-authored-by: Sandro Hanea <redacted>
Co-authored-by: Georgi Gerganov <redacted>
Georgi Gerganov [Sun, 5 Mar 2023 18:53:43 +0000 (20:53 +0200)]
whisper : set no_context == true by default (#537)
polarmoon [Sun, 5 Mar 2023 18:50:25 +0000 (10:50 -0800)]
go : NewContext now returns a clean context (#537)
Co-authored-by: Ming <redacted>
HY. Kelvin Lee [Thu, 2 Mar 2023 16:32:16 +0000 (11:32 -0500)]
main : add csv header (#552)
Georgi Gerganov [Tue, 28 Feb 2023 21:27:54 +0000 (23:27 +0200)]
make : add -DNDEBUG compile flag
Georgi Gerganov [Tue, 28 Feb 2023 20:29:12 +0000 (22:29 +0200)]
release : v1.2.1
FlippFuzz [Mon, 27 Feb 2023 19:04:16 +0000 (03:04 +0800)]
make : add "-mcpu=native" when building for aarch64 (#532)
Aaron Pham [Mon, 27 Feb 2023 19:02:11 +0000 (11:02 -0800)]
readme : add pybind11 bindings (#538)
Georgi Gerganov [Fri, 24 Feb 2023 06:46:06 +0000 (08:46 +0200)]
readme : add cython bindings (#9)
Georgi Gerganov [Tue, 21 Feb 2023 17:00:42 +0000 (19:00 +0200)]
whisper : zero-initialize some more context variables
Just in case
Finn Voorhees [Tue, 21 Feb 2023 11:42:10 +0000 (11:42 +0000)]
whisper : fix uninitialized exp_n_audio_ctx
Georgi Gerganov [Sun, 19 Feb 2023 16:35:01 +0000 (18:35 +0200)]
whisper : add API for applying custom logits filters during decoding
Georgi Gerganov [Sat, 18 Feb 2023 07:42:31 +0000 (09:42 +0200)]
yt-wsp.sh : print help on empty args
Georgi Gerganov [Wed, 15 Feb 2023 19:48:49 +0000 (21:48 +0200)]
whisper : by default disable non-speech tokens suppression (#473)
This seems to be causing hallucinations in the end of the audio, e.g.:
"Thank you for listening"
"Amen"
..
Georgi Gerganov [Wed, 15 Feb 2023 17:51:54 +0000 (19:51 +0200)]
readme : add Ruby discussion + update .NET discussion
Todd [Wed, 15 Feb 2023 17:46:55 +0000 (12:46 -0500)]
bindings : add Ruby (#500)
* adding ruby bindings
* avoid adding these they are copied in via extconf.rb
* ignore these files here
* add definitions for boolean params
* initial transcribe for ruby
* use en model and transcribe jfk with assertion
* possibly this works for building ruby binding
* ci : try to add ruby workflow
---------
Co-authored-by: Georgi Gerganov <redacted>
conradg [Wed, 15 Feb 2023 17:31:16 +0000 (17:31 +0000)]
main : fix std in input (#503)
if we don't add this as an explicit check, then we get an "error: unknown argument: -" later on
Georgi Gerganov [Wed, 15 Feb 2023 17:28:10 +0000 (19:28 +0200)]
examples : refactor in order to reuse code and reduce duplication (#482)
* examples : refactor common code into a library
* examples : refactor common SDL code into a library
* make : update Makefile to use common libs
* common : fix MSVC M_PI ..
* addon.node : link common lib
shikokuchuo [Wed, 15 Feb 2023 17:08:25 +0000 (17:08 +0000)]
whisper : fix signedness compiler warning (#506)
genevera (she/her) [Tue, 14 Feb 2023 18:12:51 +0000 (13:12 -0500)]
yt-wsp.sh : add unique filename generation (#495)
Co-authored-by: genevera <redacted>
Georgi Gerganov [Tue, 14 Feb 2023 18:04:03 +0000 (20:04 +0200)]
readme : add another .NET repo (#303)
Georgi Gerganov [Sat, 11 Feb 2023 15:35:33 +0000 (17:35 +0200)]
readme : add .NET repo (#303)
Avik Sengupta [Sat, 11 Feb 2023 07:13:32 +0000 (07:13 +0000)]
cmake : install whisper.h header (#485)
Including the header file in the install bundle helps projects that ship binaries.
shibukazu [Wed, 8 Feb 2023 07:05:34 +0000 (16:05 +0900)]
whisper : suppress non-speech-related token outputs (#473)
* add non-speech-token suppression
* add suppress non-speech_tokens param
sandrohanea [Wed, 8 Feb 2023 07:01:47 +0000 (08:01 +0100)]
whisper : fixed Beam Search Strategy and exposed whisper_pcm_to_mel_phase_vocoder (#474)
Co-authored-by: Sandro Hanea <redacted>
boolemancer [Wed, 8 Feb 2023 06:43:23 +0000 (22:43 -0800)]
whisper : only trim if split_on_word is true (#476)
Qianhe Chen [Sun, 5 Feb 2023 13:02:08 +0000 (21:02 +0800)]
ci : add node addon test and optimize compilation configuration (#468)
* addon: implement node addon call whisper through cpp
* addon: modify the license to MIT
* addon: remove iostream
* addon: rename dir
* addon: fix typo
* addon: configure cmake to build when cmake-js is used
* ci: add addon.node test ci
* addon: remove build WHISPER_BUILD_TESTS
* addon: update build command
* addon: add test
* addon: add test file
* addon: adapt to compile on Windows
* addon: fix typo
* addon: reuse jfk.wav
Co-authored-by: Georgi Gerganov <redacted>
* addon: reuse jfk.wav
---------
Co-authored-by: Georgi Gerganov <redacted>
kamranjon [Sun, 5 Feb 2023 12:46:26 +0000 (04:46 -0800)]
whisper : add whisper_full_lang_id() for getting the context lang (#461)
Matija Pevec [Sun, 5 Feb 2023 12:44:23 +0000 (13:44 +0100)]
whisper : add "split_on_word" flag when using using "max_len" option (#455)
* Update whisper.cpp
* fix: trim function
* feat: added flag to split on word
* fix: arguments for main
Georgi Gerganov [Sat, 4 Feb 2023 07:49:49 +0000 (09:49 +0200)]
release : v1.2.0
Georgi Gerganov [Sat, 4 Feb 2023 07:45:52 +0000 (09:45 +0200)]
whisper : reduce memory usage during inference (#431)
* ggml : add "scratch" buffer support
* ggml : support for scratch ring-buffer
* ggml : bug fix in ggml_repeat()
* ggml : error on scratch buffer overflow
* whisper : use scratch buffers during inference (base model only)
* whisper : update memory usage for all models
* whisper : fix encoder memory usage
* whisper : use whisper_context functions instead of macros
* whisper : fix FF + remove it from README
* ggml : reuse ggml_new_i32
* ggml : refactor the scratch buffer storage
* whisper : reorder scratch buffers in the decoder
* main : add option to disable temp fallback
* Update README.md
Qianhe Chen [Sat, 4 Feb 2023 07:10:25 +0000 (15:10 +0800)]
addon.node : using whisper as a Node.js addon (#443)
* addon: implement node addon call whisper through cpp
* addon: modify the license to MIT
* addon: remove iostream
* addon: rename dir
* addon: fix typo
* addon: configure cmake to build when cmake-js is used
polarmoon [Sat, 4 Feb 2023 07:09:27 +0000 (23:09 -0800)]
go : support "auto" as an option when set language (#462)
Co-authored-by: Ming <redacted>
Georgi Gerganov [Sat, 4 Feb 2023 07:03:14 +0000 (09:03 +0200)]
whisper : suppress task tokens (#442)
Taisei Mima [Sat, 4 Feb 2023 06:49:15 +0000 (15:49 +0900)]
wasm : fix typo in helper.js (#459)
Alex Bacart [Sat, 4 Feb 2023 06:48:35 +0000 (09:48 +0300)]
main : CSV format export trimmed spaces fix (#444)
* Update main.cpp
Removed string trimming
* Update main.cpp
* Update main.cpp
* Revert "Update main.cpp"
This reverts commit
d8924fdcfe8a2980db0af47d42ec206fc1a3e9d0.
* Revert "Update main.cpp"
This reverts commit
252e508d8597231c61380ebea42b3707714345cc.
Lukas Rist [Sat, 28 Jan 2023 16:44:56 +0000 (17:44 +0100)]
go : add wrapper for system info (#456)
Robin [Thu, 26 Jan 2023 23:14:20 +0000 (15:14 -0800)]
go : add WhisperLangAutoDetect method to go binding (#451)
Eric Tendian [Thu, 26 Jan 2023 23:13:31 +0000 (17:13 -0600)]
livestream.sh : run main with model arg instead of default (#453)
Actually utilizes the $model var when calling ./main.
Georgi Gerganov [Wed, 25 Jan 2023 17:59:57 +0000 (19:59 +0200)]
whisper : do not provide past prompt when n_max_text_ctx == 0
Georgi Gerganov [Wed, 25 Jan 2023 17:59:15 +0000 (19:59 +0200)]
wasm : check if navigator.storage.estimate() is available
Safari does not support it
Lukas Rist [Wed, 25 Jan 2023 16:57:30 +0000 (17:57 +0100)]
go : added wrappers to reset and print timings (#436)
Georgi Gerganov [Wed, 25 Jan 2023 16:50:50 +0000 (18:50 +0200)]
ci : run workflows on pull requests + bindings depend on .h (#446)
Ondrej Kokes [Mon, 23 Jan 2023 18:49:00 +0000 (19:49 +0100)]
whisper.wasm : add labels for easier radio selection (#435)
Georgi Gerganov [Mon, 23 Jan 2023 18:48:26 +0000 (20:48 +0200)]
whisper : condition timestamps to be monotonically increasing (#425)
fitzsim [Mon, 23 Jan 2023 18:48:10 +0000 (18:48 +0000)]
whisper : PPC64 big-endian support (#398)
* ggml : set cache line size to 128 on POWER9
* whisper : add PPC64 big endian support
Georgi Gerganov [Mon, 23 Jan 2023 18:23:44 +0000 (20:23 +0200)]
release : v1.1.1
Georgi Gerganov [Mon, 23 Jan 2023 18:19:04 +0000 (20:19 +0200)]
.gitignore : add arm_neon.h
Georgi Gerganov [Sun, 22 Jan 2023 08:47:01 +0000 (10:47 +0200)]
whisper : fix condition for providing past prompt (critical)
This bug has been present since v1.1.0.
Effectively, the past transcribed text wasn't being used for following
transcriptions, which likely significantly reduces the transcription
quality.
Likely related to #419
Georgi Gerganov [Thu, 19 Jan 2023 16:50:33 +0000 (18:50 +0200)]
whisper : extend information in whisper_print_timings()
Georgi Gerganov [Wed, 18 Jan 2023 20:52:18 +0000 (22:52 +0200)]
whisper : perform entropy check only when we have at least 32 tokens (#412)
Georgi Gerganov [Wed, 18 Jan 2023 20:51:41 +0000 (22:51 +0200)]
main : we had accidentally disabled the temperature fallback .. (#291)
Georgi Gerganov [Wed, 18 Jan 2023 19:58:55 +0000 (21:58 +0200)]
whisper.wasm : add small and small.en models
Georgi Gerganov [Wed, 18 Jan 2023 19:39:32 +0000 (21:39 +0200)]
bench : minor fixes
Chia-Hsiang Cheng [Wed, 18 Jan 2023 19:26:31 +0000 (03:26 +0800)]
main : add an option to accept optional output filenames (#424)
* Add an option to accept optional output filenames
* Format the file
Co-authored-by: Chia-Hsiang Cheng <redacted>
Georgi Gerganov [Wed, 18 Jan 2023 19:22:52 +0000 (21:22 +0200)]
stream : fix handling of --step == --length (#416)
Georgi Gerganov [Wed, 18 Jan 2023 19:00:41 +0000 (21:00 +0200)]
bench : fix Windows linkage by moving ggml benches in whisper lib ..
Georgi Gerganov [Wed, 18 Jan 2023 18:31:46 +0000 (20:31 +0200)]
bench : add memcpy and ggml_mul_mat benchmarks
Digipom [Mon, 16 Jan 2023 19:57:35 +0000 (14:57 -0500)]
whisper.android : add support for loading directly from asset in C (#415)
Georgi Gerganov [Mon, 16 Jan 2023 19:44:40 +0000 (21:44 +0200)]
whisper : fix possible uninitialized variables (#291)
Georgi Gerganov [Mon, 16 Jan 2023 17:37:06 +0000 (19:37 +0200)]
stream : fix --keep_context argument to be used correctly (#354)
Damian Czaja [Mon, 16 Jan 2023 17:18:10 +0000 (18:18 +0100)]
go : remove sample_best and sample_timestamp bindings (#409)
Georgi Gerganov [Mon, 16 Jan 2023 16:47:31 +0000 (18:47 +0200)]
Update README.md
Georgi Gerganov [Sun, 15 Jan 2023 12:22:25 +0000 (14:22 +0200)]
cmake : bump version to 1.1.0
Georgi Gerganov [Sun, 15 Jan 2023 12:08:12 +0000 (14:08 +0200)]
Update README.md
Georgi Gerganov [Sun, 15 Jan 2023 10:41:42 +0000 (12:41 +0200)]
whisper : account speed_up flag for short audio (close #405)
Georgi Gerganov [Sun, 15 Jan 2023 09:36:20 +0000 (11:36 +0200)]
Update README.md
Georgi Gerganov [Sun, 15 Jan 2023 09:34:03 +0000 (11:34 +0200)]
bench.wasm : print system info
Georgi Gerganov [Sun, 15 Jan 2023 09:29:57 +0000 (11:29 +0200)]
Improve decoding (#291)
* whisper : prepare infra for new decoding strategies
* whisper : apply logit filters and compute logprobs
* whisper : add whisper_get_logits()
* whisper : separate self and cross attention memory
Initial step needed for supporting parallel decoders
* whisper : move probs_id buffer to whisper_context
* whisper : refactor kv cache into separate struct
* whisper : move self-attention kv cache to whisper_decoder
* whisper : wip decoding parameters + strategies
* whisper : wip decoding parameters + strategies (part 2)
* whisper : wip decoding parameters + strategies (part 3)
* whisper : wip decoding parameters + strategies (part 4)
* whisper : fix prompt_past update to not include prompt_init
* whisper : temperature + best_of support
* whisper : support for compression_ration_threshold
We actually use entropy, but it is similar
* command : fix example to use logits instead of obsolete probs
* whisper : handle empty sequence ranking
* whisper : add WHISPER_DEBUG + diagnostic prints + new main args
* whisper : minor fixes
* whisper : add beam-search support
* whisper : bug fix when there no previous context
* whisper : add comments
* stream : disable temperature fallback
For real-time processing, we always want a single decoder running at T=0
* whisper.swiftui : update example - fix paths + add empty folders
Georgi Gerganov [Sat, 14 Jan 2023 17:20:47 +0000 (19:20 +0200)]
stream : fix a bug that inserted a lot of empty audio at the start
The quality was terrible due to this
Georgi Gerganov [Sun, 8 Jan 2023 10:35:56 +0000 (12:35 +0200)]
ggml : remove obsolete zeroing + comment fixes (#390)
Ian Bicking [Sun, 8 Jan 2023 18:18:41 +0000 (12:18 -0600)]
readme : clarify main and stream usage (#391)
Give an example of ./main that uses a sample file that's already there, and make the stream example clarify you need `make stream`
Abitofevrything [Sun, 8 Jan 2023 18:06:09 +0000 (19:06 +0100)]
ggml : correct behaviour of ggml_vec_sum_f32 (#390)