]>
git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/log
Georgi Gerganov [Thu, 27 Oct 2022 15:31:49 +0000 (18:31 +0300)]
Add OpenBLAS support
Supported via CMake - just add:
cmake .. -DWHISPER_SUPPORT_OPENBLAS=ON
On Ubuntu, you have to install the library like this:
apt install libopenblas-dev
Unfortunately, I don't observe any benefit compared to the
original AVX2 + FP16 implementation. Maybe I'm missing something
Georgi Gerganov [Thu, 27 Oct 2022 14:22:10 +0000 (17:22 +0300)]
Print system info at start of program
Mikhail Grigorev [Thu, 27 Oct 2022 09:59:02 +0000 (14:59 +0500)]
Fixed compile definitions and link libraries for MSVC
Georgi Gerganov [Wed, 26 Oct 2022 20:19:58 +0000 (23:19 +0300)]
Add helper script to benchmark all models
Simply run:
$ ./extra/bench-all.sh
Georgi Gerganov [Wed, 26 Oct 2022 19:54:09 +0000 (22:54 +0300)]
Print system info in main
Georgi Gerganov [Wed, 26 Oct 2022 15:14:10 +0000 (18:14 +0300)]
Create README.md
andypayne [Wed, 26 Oct 2022 00:35:11 +0000 (17:35 -0700)]
Changes to work by default on macOS - use curl when wget is not available, and use an alternative method to get the script path when realpath is not available.
Georgi Gerganov [Tue, 25 Oct 2022 17:51:56 +0000 (20:51 +0300)]
Update README.md
Georgi Gerganov [Tue, 25 Oct 2022 17:47:31 +0000 (20:47 +0300)]
Update README.md
Georgi Gerganov [Tue, 25 Oct 2022 17:43:10 +0000 (20:43 +0300)]
Update README.md
Georgi Gerganov [Tue, 25 Oct 2022 17:28:47 +0000 (20:28 +0300)]
Update README.md
Georgi Gerganov [Tue, 25 Oct 2022 17:25:23 +0000 (20:25 +0300)]
Update README.md
Georgi Gerganov [Tue, 25 Oct 2022 17:23:39 +0000 (20:23 +0300)]
Update README.md
Georgi Gerganov [Tue, 25 Oct 2022 17:18:26 +0000 (20:18 +0300)]
ggml : add system info functions
Georgi Gerganov [Tue, 25 Oct 2022 16:13:08 +0000 (19:13 +0300)]
refactoring : move main + stream in examples + other stuff
Georgi Gerganov [Mon, 24 Oct 2022 15:28:23 +0000 (18:28 +0300)]
main : fix SRT timestamp to use comma "," instead of dot "."
Georgi Gerganov [Mon, 24 Oct 2022 15:26:21 +0000 (18:26 +0300)]
Update README.md
Georgi Gerganov [Sun, 23 Oct 2022 09:51:09 +0000 (12:51 +0300)]
Update README.md
Georgi Gerganov [Sun, 23 Oct 2022 09:24:03 +0000 (12:24 +0300)]
objc : polishing the sample application
Georgi Gerganov [Sun, 23 Oct 2022 08:36:36 +0000 (11:36 +0300)]
Create README.md
Georgi Gerganov [Sun, 23 Oct 2022 08:10:15 +0000 (11:10 +0300)]
ios : whisper.objc example
Georgi Gerganov [Sun, 23 Oct 2022 10:29:36 +0000 (13:29 +0300)]
ref #68, #79 : fix segment time output
Georgi Gerganov [Sun, 23 Oct 2022 09:47:51 +0000 (12:47 +0300)]
Update README.md
Georgi Gerganov [Sun, 23 Oct 2022 09:23:04 +0000 (12:23 +0300)]
Merge pull request #78 from jokkebk/Specify-utf8-for-vocab.json
Add enconding parameter to vocab.json opening to fix errors
Joonas Pihlajamaa [Sun, 23 Oct 2022 08:55:01 +0000 (11:55 +0300)]
Add enconding parameter to vocab.json opening to fix errors
Georgi Gerganov [Sun, 23 Oct 2022 07:24:36 +0000 (10:24 +0300)]
Update README.md
Georgi Gerganov [Sun, 23 Oct 2022 07:12:10 +0000 (10:12 +0300)]
Update README.md
Georgi Gerganov [Sun, 23 Oct 2022 05:04:33 +0000 (08:04 +0300)]
Update README.md
Georgi Gerganov [Sat, 22 Oct 2022 18:16:08 +0000 (21:16 +0300)]
Update README.md
Georgi Gerganov [Sat, 22 Oct 2022 18:09:30 +0000 (21:09 +0300)]
main : print colors + no timestamps
Georgi Gerganov [Sat, 22 Oct 2022 18:06:50 +0000 (21:06 +0300)]
whisper : add new-segment callback
Can be used to process new segments as they are being generated.
Sample usage in main, for printing the resulting segments during the
inference.
Georgi Gerganov [Sat, 22 Oct 2022 17:42:11 +0000 (20:42 +0300)]
main : refactor subtitle output
Georgi Gerganov [Fri, 21 Oct 2022 14:33:59 +0000 (17:33 +0300)]
wip : experimental color coding of tokens based on probabilities
Georgi Gerganov [Sat, 22 Oct 2022 16:30:35 +0000 (19:30 +0300)]
Update README.md
Georgi Gerganov [Sat, 22 Oct 2022 16:00:25 +0000 (19:00 +0300)]
Update README.md
Georgi Gerganov [Sat, 22 Oct 2022 15:17:08 +0000 (18:17 +0300)]
Update README.md and finalize the whisper.wasm example
Georgi Gerganov [Sat, 22 Oct 2022 13:57:54 +0000 (16:57 +0300)]
wip : polishing WASM example
Georgi Gerganov [Sat, 22 Oct 2022 10:45:43 +0000 (13:45 +0300)]
wip : WASM 128-bit SIMD support
Georgi Gerganov [Sat, 22 Oct 2022 10:07:59 +0000 (13:07 +0300)]
wip : initial WASM port
Works but it is very slow because no SIMD is used.
For example, jfk.wav is processed in ~23 seconds using "tiny.en" model
Georgi Gerganov [Fri, 21 Oct 2022 15:14:53 +0000 (18:14 +0300)]
ref #68 : add option "-on" to specify segment index offset for SRT
Also, change option "-o" to "-ot"
Georgi Gerganov [Fri, 21 Oct 2022 12:57:20 +0000 (15:57 +0300)]
ci : fix and re-enable tests (2nd try)
Georgi Gerganov [Fri, 21 Oct 2022 12:36:19 +0000 (15:36 +0300)]
Revert "ci : fix and re-enable tests"
This reverts commit
80aefc9514262a026e013902fa3722e53181cd82 .
Georgi Gerganov [Fri, 21 Oct 2022 12:27:30 +0000 (15:27 +0300)]
ci : fix and re-enable tests
Georgi Gerganov [Thu, 20 Oct 2022 14:52:59 +0000 (17:52 +0300)]
Update README.md
Georgi Gerganov [Thu, 20 Oct 2022 14:43:56 +0000 (17:43 +0300)]
Update README.md
Georgi Gerganov [Thu, 20 Oct 2022 14:39:31 +0000 (17:39 +0300)]
Update README.md
Georgi Gerganov [Wed, 19 Oct 2022 06:29:43 +0000 (09:29 +0300)]
Merge pull request #67 from undefdev/defensive-apple-arm-make
added handling for falsely as x86_64 announced ARM Macs
Georgi Gerganov [Wed, 19 Oct 2022 05:36:59 +0000 (08:36 +0300)]
ci : disable tests temporarily
undef [Tue, 18 Oct 2022 23:01:53 +0000 (01:01 +0200)]
added handling for falsely as x86_64 announced ARM Macs
Georgi Gerganov [Tue, 18 Oct 2022 19:20:17 +0000 (22:20 +0300)]
Update README.md
Georgi Gerganov [Tue, 18 Oct 2022 17:15:06 +0000 (20:15 +0300)]
stream : print warning when processing is not fast enough
Georgi Gerganov [Tue, 18 Oct 2022 16:48:34 +0000 (19:48 +0300)]
Add reference to FP16 repo
Georgi Gerganov [Tue, 18 Oct 2022 16:33:10 +0000 (19:33 +0300)]
ref #52 : improve greedy sampling strategy
Force timestamp token to be sampled if the probability sum over all
timestamp tokens is above the probability of any other token
Georgi Gerganov [Tue, 18 Oct 2022 15:51:59 +0000 (18:51 +0300)]
CMake support for Accelerate framework
Georgi Gerganov [Tue, 18 Oct 2022 15:42:08 +0000 (18:42 +0300)]
Correct implementation of FP16 GELU
Can toggle it via the GGML_GELU_FP16 macro
Georgi Gerganov [Tue, 18 Oct 2022 15:27:57 +0000 (18:27 +0300)]
close #32 : add comment about thread-safety of the C-style API
Georgi Gerganov [Tue, 18 Oct 2022 15:17:24 +0000 (18:17 +0300)]
ref #57, #62, #63 : remove unions in C-api + remove designated initializers
We are not ready for designated initializers - many compilers do not
support this C++ feature yet, so removing it's non-trivial usages.
Georgi Gerganov [Tue, 18 Oct 2022 07:56:37 +0000 (10:56 +0300)]
ref #62 : fix build for some compilers
For some reason, new version of GCC panic when the struct type is not
specified explicitly
Georgi Gerganov [Tue, 18 Oct 2022 05:26:16 +0000 (08:26 +0300)]
Revert recent sampling change
It does not actually help and seems to produce worse results on some of
the samples
Georgi Gerganov [Mon, 17 Oct 2022 21:53:06 +0000 (00:53 +0300)]
Fix EOT token handling
If it is the end of the audio, pick all sampled tokens.
Otherwise, print error message.
Georgi Gerganov [Mon, 17 Oct 2022 21:45:08 +0000 (00:45 +0300)]
Revert GELU change
Seems it does not work on x86 for some reason
Georgi Gerganov [Mon, 17 Oct 2022 21:01:31 +0000 (00:01 +0300)]
Link Accelerate framework to "stream" example
Georgi Gerganov [Mon, 17 Oct 2022 20:52:24 +0000 (23:52 +0300)]
Try to improve the sampling strategy a bit
It sill fails sometimes when it does not sample a timestamp token for
the entire segment. We now print a message in such cases
Georgi Gerganov [Mon, 17 Oct 2022 20:50:44 +0000 (23:50 +0300)]
Fix memory sizes
Georgi Gerganov [Mon, 17 Oct 2022 18:44:16 +0000 (21:44 +0300)]
Use Accelerate framework on Apple silicon
Huge performance improvement in the Encode (almost x2 on MacBook M1 Pro)
Also various extra optimizations:
- Multi-threaded NORM operator
- Faster GELU via F16 cast
Georgi Gerganov [Mon, 17 Oct 2022 15:37:06 +0000 (18:37 +0300)]
Adding helper script for converting the PT models
Georgi Gerganov [Mon, 17 Oct 2022 15:09:17 +0000 (18:09 +0300)]
close #56 : build on FreeBSD
Thanks to @abelbabel for the contribution
Georgi Gerganov [Mon, 17 Oct 2022 15:06:02 +0000 (18:06 +0300)]
Merge pull request #58 from r0y6a3n0/master
fix decode missing token issue
RyanChang [Mon, 17 Oct 2022 13:19:45 +0000 (21:19 +0800)]
fix missing token issue
Georgi Gerganov [Sat, 15 Oct 2022 06:55:28 +0000 (09:55 +0300)]
ref #48 : clear results at the start of whisper_full
This way, even if the input audio is empty, the previous results will be
removed.
Georgi Gerganov [Sat, 15 Oct 2022 06:40:08 +0000 (09:40 +0300)]
Update README.md (ref #50)
Georgi Gerganov [Wed, 12 Oct 2022 18:25:42 +0000 (21:25 +0300)]
Update README.md
0/0 [Tue, 11 Oct 2022 17:49:48 +0000 (11:49 -0600)]
check if spectogram length is <100 before doing anything else
fixes #39
Topping1 [Wed, 12 Oct 2022 04:32:14 +0000 (23:32 -0500)]
Update README.md (#43)
* Update README.md
Updated README.md to list new features, such as subtitle file support (VTT and SRT)
* Update README.md
Co-authored-by: Georgi Gerganov <redacted>
Co-authored-by: Georgi Gerganov <redacted>
Georgi Gerganov [Wed, 12 Oct 2022 04:31:41 +0000 (07:31 +0300)]
Merge pull request #42 from iboB/msvc-build
ref #5 : MSVC build
Borislav Stanimirov [Tue, 11 Oct 2022 17:57:52 +0000 (20:57 +0300)]
Building with MSVC
Borislav Stanimirov [Tue, 11 Oct 2022 17:57:33 +0000 (20:57 +0300)]
Visual Studio ignored dirs
Georgi Gerganov [Mon, 10 Oct 2022 21:36:32 +0000 (00:36 +0300)]
Update README.md
Georgi Gerganov [Mon, 10 Oct 2022 19:16:25 +0000 (22:16 +0300)]
Update README.md
Georgi Gerganov [Mon, 10 Oct 2022 19:06:18 +0000 (22:06 +0300)]
stream : improve real-time transcription
Georgi Gerganov [Mon, 10 Oct 2022 19:06:03 +0000 (22:06 +0300)]
Minor
Georgi Gerganov [Mon, 10 Oct 2022 19:05:37 +0000 (22:05 +0300)]
Update README.md
Georgi Gerganov [Mon, 10 Oct 2022 06:13:31 +0000 (09:13 +0300)]
Merge pull request #36 from Topping1/master
Fix SRT timestamp format from mm:ss.sss to hh:mm:ss.sss
Georgi Gerganov [Mon, 10 Oct 2022 05:11:18 +0000 (08:11 +0300)]
ref #35 : add <stdbool.h> to whisper.h
"bool" type is not implicitly defined for some compilers.
Georgi Gerganov [Mon, 10 Oct 2022 05:05:57 +0000 (08:05 +0300)]
Merge pull request #34 from tazz4843/master
Add static library make target
Topping1 [Mon, 10 Oct 2022 04:35:10 +0000 (23:35 -0500)]
Update main.cpp
0/0 [Mon, 10 Oct 2022 01:16:42 +0000 (19:16 -0600)]
add static library make target
Georgi Gerganov [Sun, 9 Oct 2022 14:52:46 +0000 (17:52 +0300)]
Merge pull request #31 from lkwq007/master
Add MinGW support
lnyan [Sun, 9 Oct 2022 14:26:37 +0000 (22:26 +0800)]
Add MinGW support
Georgi Gerganov [Sat, 8 Oct 2022 15:13:26 +0000 (18:13 +0300)]
Minor
Georgi Gerganov [Sat, 8 Oct 2022 15:09:56 +0000 (18:09 +0300)]
ref #9 : add API documentation in whisper.h
Georgi Gerganov [Sat, 8 Oct 2022 14:35:55 +0000 (17:35 +0300)]
Fix Makefile for MacBook Intel
Georgi Gerganov [Sat, 8 Oct 2022 14:28:06 +0000 (17:28 +0300)]
ref #17 : print whisper logs to stderr
Only the transcribed/translted text is printed to stdout.
This way, one can redirect the result to a file.
Georgi Gerganov [Sat, 8 Oct 2022 14:22:22 +0000 (17:22 +0300)]
ref #17 : add options to output result to file
Support for:
- plain text
- VTT
- SRT
Georgi Gerganov [Sat, 8 Oct 2022 08:46:34 +0000 (11:46 +0300)]
Update README.md
Georgi Gerganov [Sat, 8 Oct 2022 08:34:20 +0000 (11:34 +0300)]
Update tests
Georgi Gerganov [Sat, 8 Oct 2022 08:17:41 +0000 (11:17 +0300)]
ci : add base model tests to GH Actions
Georgi Gerganov [Sat, 8 Oct 2022 08:17:29 +0000 (11:17 +0300)]
Update README.md
Georgi Gerganov [Sat, 8 Oct 2022 08:16:37 +0000 (11:16 +0300)]
Create README.md
Georgi Gerganov [Sat, 8 Oct 2022 07:57:42 +0000 (10:57 +0300)]
Adding dummy models for testing purposes