]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/log
pkg/ggml/sources/whisper.cpp
2 years agoios : whisper.objc example
Georgi Gerganov [Sun, 23 Oct 2022 08:10:15 +0000 (11:10 +0300)]
ios : whisper.objc example

2 years agoref #68, #79 : fix segment time output
Georgi Gerganov [Sun, 23 Oct 2022 10:29:36 +0000 (13:29 +0300)]
ref #68, #79 : fix segment time output

2 years agoUpdate README.md
Georgi Gerganov [Sun, 23 Oct 2022 09:47:51 +0000 (12:47 +0300)]
Update README.md

2 years agoMerge pull request #78 from jokkebk/Specify-utf8-for-vocab.json
Georgi Gerganov [Sun, 23 Oct 2022 09:23:04 +0000 (12:23 +0300)]
Merge pull request #78 from jokkebk/Specify-utf8-for-vocab.json

Add enconding parameter to vocab.json opening to fix errors

2 years agoAdd enconding parameter to vocab.json opening to fix errors
Joonas Pihlajamaa [Sun, 23 Oct 2022 08:55:01 +0000 (11:55 +0300)]
Add enconding parameter to vocab.json opening to fix errors

2 years agoUpdate README.md
Georgi Gerganov [Sun, 23 Oct 2022 07:24:36 +0000 (10:24 +0300)]
Update README.md

2 years agoUpdate README.md
Georgi Gerganov [Sun, 23 Oct 2022 07:12:10 +0000 (10:12 +0300)]
Update README.md

2 years agoUpdate README.md
Georgi Gerganov [Sun, 23 Oct 2022 05:04:33 +0000 (08:04 +0300)]
Update README.md

2 years agoUpdate README.md
Georgi Gerganov [Sat, 22 Oct 2022 18:16:08 +0000 (21:16 +0300)]
Update README.md

2 years agomain : print colors + no timestamps
Georgi Gerganov [Sat, 22 Oct 2022 18:09:30 +0000 (21:09 +0300)]
main : print colors + no timestamps

2 years agowhisper : add new-segment callback
Georgi Gerganov [Sat, 22 Oct 2022 18:06:50 +0000 (21:06 +0300)]
whisper : add new-segment callback

Can be used to process new segments as they are being generated.
Sample usage in main, for printing the resulting segments during the
inference.

2 years agomain : refactor subtitle output
Georgi Gerganov [Sat, 22 Oct 2022 17:42:11 +0000 (20:42 +0300)]
main : refactor subtitle output

2 years agowip : experimental color coding of tokens based on probabilities
Georgi Gerganov [Fri, 21 Oct 2022 14:33:59 +0000 (17:33 +0300)]
wip : experimental color coding of tokens based on probabilities

2 years agoUpdate README.md
Georgi Gerganov [Sat, 22 Oct 2022 16:30:35 +0000 (19:30 +0300)]
Update README.md

2 years agoUpdate README.md
Georgi Gerganov [Sat, 22 Oct 2022 16:00:25 +0000 (19:00 +0300)]
Update README.md

2 years agoUpdate README.md and finalize the whisper.wasm example
Georgi Gerganov [Sat, 22 Oct 2022 15:17:08 +0000 (18:17 +0300)]
Update README.md and finalize the whisper.wasm example

2 years agowip : polishing WASM example
Georgi Gerganov [Sat, 22 Oct 2022 13:57:54 +0000 (16:57 +0300)]
wip : polishing WASM example

2 years agowip : WASM 128-bit SIMD support
Georgi Gerganov [Sat, 22 Oct 2022 10:45:43 +0000 (13:45 +0300)]
wip : WASM 128-bit SIMD support

2 years agowip : initial WASM port
Georgi Gerganov [Sat, 22 Oct 2022 10:07:59 +0000 (13:07 +0300)]
wip : initial WASM port

Works but it is very slow because no SIMD is used.
For example, jfk.wav is processed in ~23 seconds using "tiny.en" model

2 years agoref #68 : add option "-on" to specify segment index offset for SRT
Georgi Gerganov [Fri, 21 Oct 2022 15:14:53 +0000 (18:14 +0300)]
ref #68 : add option "-on" to specify segment index offset for SRT

Also, change option "-o" to "-ot"

2 years agoci : fix and re-enable tests (2nd try)
Georgi Gerganov [Fri, 21 Oct 2022 12:57:20 +0000 (15:57 +0300)]
ci : fix and re-enable tests (2nd try)

2 years agoRevert "ci : fix and re-enable tests"
Georgi Gerganov [Fri, 21 Oct 2022 12:36:19 +0000 (15:36 +0300)]
Revert "ci : fix and re-enable tests"

This reverts commit 80aefc9514262a026e013902fa3722e53181cd82.

2 years agoci : fix and re-enable tests
Georgi Gerganov [Fri, 21 Oct 2022 12:27:30 +0000 (15:27 +0300)]
ci : fix and re-enable tests

2 years agoUpdate README.md
Georgi Gerganov [Thu, 20 Oct 2022 14:52:59 +0000 (17:52 +0300)]
Update README.md

2 years agoUpdate README.md
Georgi Gerganov [Thu, 20 Oct 2022 14:43:56 +0000 (17:43 +0300)]
Update README.md

2 years agoUpdate README.md
Georgi Gerganov [Thu, 20 Oct 2022 14:39:31 +0000 (17:39 +0300)]
Update README.md

2 years agoMerge pull request #67 from undefdev/defensive-apple-arm-make
Georgi Gerganov [Wed, 19 Oct 2022 06:29:43 +0000 (09:29 +0300)]
Merge pull request #67 from undefdev/defensive-apple-arm-make

added handling for falsely as x86_64 announced ARM Macs

2 years agoci : disable tests temporarily
Georgi Gerganov [Wed, 19 Oct 2022 05:36:59 +0000 (08:36 +0300)]
ci : disable tests temporarily

2 years agoadded handling for falsely as x86_64 announced ARM Macs
undef [Tue, 18 Oct 2022 23:01:53 +0000 (01:01 +0200)]
added handling for falsely as x86_64 announced ARM Macs

2 years agoUpdate README.md
Georgi Gerganov [Tue, 18 Oct 2022 19:20:17 +0000 (22:20 +0300)]
Update README.md

2 years agostream : print warning when processing is not fast enough
Georgi Gerganov [Tue, 18 Oct 2022 17:15:06 +0000 (20:15 +0300)]
stream : print warning when processing is not fast enough

2 years agoAdd reference to FP16 repo
Georgi Gerganov [Tue, 18 Oct 2022 16:48:34 +0000 (19:48 +0300)]
Add reference to FP16 repo

2 years agoref #52 : improve greedy sampling strategy
Georgi Gerganov [Tue, 18 Oct 2022 16:33:10 +0000 (19:33 +0300)]
ref #52 : improve greedy sampling strategy

Force timestamp token to be sampled if the probability sum over all
timestamp tokens is above the probability of any other token

2 years agoCMake support for Accelerate framework
Georgi Gerganov [Tue, 18 Oct 2022 15:51:59 +0000 (18:51 +0300)]
CMake support for Accelerate framework

2 years agoCorrect implementation of FP16 GELU
Georgi Gerganov [Tue, 18 Oct 2022 15:42:08 +0000 (18:42 +0300)]
Correct implementation of FP16 GELU

Can toggle it via the GGML_GELU_FP16 macro

2 years agoclose #32 : add comment about thread-safety of the C-style API
Georgi Gerganov [Tue, 18 Oct 2022 15:27:57 +0000 (18:27 +0300)]
close #32 : add comment about thread-safety of the C-style API

2 years agoref #57, #62, #63 : remove unions in C-api + remove designated initializers
Georgi Gerganov [Tue, 18 Oct 2022 15:17:24 +0000 (18:17 +0300)]
ref #57, #62, #63 : remove unions in C-api + remove designated initializers

We are not ready for designated initializers - many compilers do not
support this C++ feature yet, so removing it's non-trivial usages.

2 years agoref #62 : fix build for some compilers
Georgi Gerganov [Tue, 18 Oct 2022 07:56:37 +0000 (10:56 +0300)]
ref #62 : fix build for some compilers

For some reason, new version of GCC panic when the struct type is not
specified explicitly

2 years agoRevert recent sampling change
Georgi Gerganov [Tue, 18 Oct 2022 05:26:16 +0000 (08:26 +0300)]
Revert recent sampling change

It does not actually help and seems to produce worse results on some of
the samples

2 years agoFix EOT token handling
Georgi Gerganov [Mon, 17 Oct 2022 21:53:06 +0000 (00:53 +0300)]
Fix EOT token handling

If it is the end of the audio, pick all sampled tokens.
Otherwise, print error message.

2 years agoRevert GELU change
Georgi Gerganov [Mon, 17 Oct 2022 21:45:08 +0000 (00:45 +0300)]
Revert GELU change

Seems it does not work on x86 for some reason

2 years agoLink Accelerate framework to "stream" example
Georgi Gerganov [Mon, 17 Oct 2022 21:01:31 +0000 (00:01 +0300)]
Link Accelerate framework to "stream" example

2 years agoTry to improve the sampling strategy a bit
Georgi Gerganov [Mon, 17 Oct 2022 20:52:24 +0000 (23:52 +0300)]
Try to improve the sampling strategy a bit

It sill fails sometimes when it does not sample a timestamp token for
the entire segment. We now print a message in such cases

2 years agoFix memory sizes
Georgi Gerganov [Mon, 17 Oct 2022 20:50:44 +0000 (23:50 +0300)]
Fix memory sizes

2 years agoUse Accelerate framework on Apple silicon
Georgi Gerganov [Mon, 17 Oct 2022 18:44:16 +0000 (21:44 +0300)]
Use Accelerate framework on Apple silicon

Huge performance improvement in the Encode (almost x2 on MacBook M1 Pro)

Also various extra optimizations:

- Multi-threaded NORM operator
- Faster GELU via F16 cast

2 years agoAdding helper script for converting the PT models
Georgi Gerganov [Mon, 17 Oct 2022 15:37:06 +0000 (18:37 +0300)]
Adding helper script for converting the PT models

2 years agoclose #56 : build on FreeBSD
Georgi Gerganov [Mon, 17 Oct 2022 15:09:17 +0000 (18:09 +0300)]
close #56 : build on FreeBSD

Thanks to @abelbabel for the contribution

2 years agoMerge pull request #58 from r0y6a3n0/master
Georgi Gerganov [Mon, 17 Oct 2022 15:06:02 +0000 (18:06 +0300)]
Merge pull request #58 from r0y6a3n0/master

fix decode missing token issue

2 years agofix missing token issue
RyanChang [Mon, 17 Oct 2022 13:19:45 +0000 (21:19 +0800)]
fix missing token issue

2 years agoref #48 : clear results at the start of whisper_full
Georgi Gerganov [Sat, 15 Oct 2022 06:55:28 +0000 (09:55 +0300)]
ref #48 : clear results at the start of whisper_full

This way, even if the input audio is empty, the previous results will be
removed.

2 years agoUpdate README.md (ref #50)
Georgi Gerganov [Sat, 15 Oct 2022 06:40:08 +0000 (09:40 +0300)]
Update README.md (ref #50)

2 years agoUpdate README.md
Georgi Gerganov [Wed, 12 Oct 2022 18:25:42 +0000 (21:25 +0300)]
Update README.md

2 years agocheck if spectogram length is <100 before doing anything else
0/0 [Tue, 11 Oct 2022 17:49:48 +0000 (11:49 -0600)]
check if spectogram length is <100 before doing anything else

fixes #39

2 years agoUpdate README.md (#43)
Topping1 [Wed, 12 Oct 2022 04:32:14 +0000 (23:32 -0500)]
Update README.md (#43)

* Update README.md

Updated README.md to list new features, such as subtitle file support (VTT and SRT)

* Update README.md

Co-authored-by: Georgi Gerganov <redacted>
Co-authored-by: Georgi Gerganov <redacted>
2 years agoMerge pull request #42 from iboB/msvc-build
Georgi Gerganov [Wed, 12 Oct 2022 04:31:41 +0000 (07:31 +0300)]
Merge pull request #42 from iboB/msvc-build

ref #5 : MSVC build

2 years agoBuilding with MSVC
Borislav Stanimirov [Tue, 11 Oct 2022 17:57:52 +0000 (20:57 +0300)]
Building with MSVC

2 years agoVisual Studio ignored dirs
Borislav Stanimirov [Tue, 11 Oct 2022 17:57:33 +0000 (20:57 +0300)]
Visual Studio ignored dirs

2 years agoUpdate README.md
Georgi Gerganov [Mon, 10 Oct 2022 21:36:32 +0000 (00:36 +0300)]
Update README.md

2 years agoUpdate README.md
Georgi Gerganov [Mon, 10 Oct 2022 19:16:25 +0000 (22:16 +0300)]
Update README.md

2 years agostream : improve real-time transcription
Georgi Gerganov [Mon, 10 Oct 2022 19:06:18 +0000 (22:06 +0300)]
stream : improve real-time transcription

2 years agoMinor
Georgi Gerganov [Mon, 10 Oct 2022 19:06:03 +0000 (22:06 +0300)]
Minor

2 years agoUpdate README.md
Georgi Gerganov [Mon, 10 Oct 2022 19:05:37 +0000 (22:05 +0300)]
Update README.md

2 years agoMerge pull request #36 from Topping1/master
Georgi Gerganov [Mon, 10 Oct 2022 06:13:31 +0000 (09:13 +0300)]
Merge pull request #36 from Topping1/master

Fix SRT timestamp format from mm:ss.sss to hh:mm:ss.sss

2 years agoref #35 : add <stdbool.h> to whisper.h
Georgi Gerganov [Mon, 10 Oct 2022 05:11:18 +0000 (08:11 +0300)]
ref #35 : add <stdbool.h> to whisper.h

"bool" type is not implicitly defined for some compilers.

2 years agoMerge pull request #34 from tazz4843/master
Georgi Gerganov [Mon, 10 Oct 2022 05:05:57 +0000 (08:05 +0300)]
Merge pull request #34 from tazz4843/master

Add static library make target

2 years agoUpdate main.cpp
Topping1 [Mon, 10 Oct 2022 04:35:10 +0000 (23:35 -0500)]
Update main.cpp

2 years agoadd static library make target
0/0 [Mon, 10 Oct 2022 01:16:42 +0000 (19:16 -0600)]
add static library make target

2 years agoMerge pull request #31 from lkwq007/master
Georgi Gerganov [Sun, 9 Oct 2022 14:52:46 +0000 (17:52 +0300)]
Merge pull request #31 from lkwq007/master

Add MinGW support

2 years agoAdd MinGW support
lnyan [Sun, 9 Oct 2022 14:26:37 +0000 (22:26 +0800)]
Add MinGW support

2 years agoMinor
Georgi Gerganov [Sat, 8 Oct 2022 15:13:26 +0000 (18:13 +0300)]
Minor

2 years agoref #9 : add API documentation in whisper.h
Georgi Gerganov [Sat, 8 Oct 2022 15:09:56 +0000 (18:09 +0300)]
ref #9 : add API documentation in whisper.h

2 years agoFix Makefile for MacBook Intel
Georgi Gerganov [Sat, 8 Oct 2022 14:35:55 +0000 (17:35 +0300)]
Fix Makefile for MacBook Intel

2 years agoref #17 : print whisper logs to stderr
Georgi Gerganov [Sat, 8 Oct 2022 14:28:06 +0000 (17:28 +0300)]
ref #17 : print whisper logs to stderr

Only the transcribed/translted text is printed to stdout.
This way, one can redirect the result to a file.

2 years agoref #17 : add options to output result to file
Georgi Gerganov [Sat, 8 Oct 2022 14:22:22 +0000 (17:22 +0300)]
ref #17 : add options to output result to file

Support for:

- plain text
- VTT
- SRT

2 years agoUpdate README.md
Georgi Gerganov [Sat, 8 Oct 2022 08:46:34 +0000 (11:46 +0300)]
Update README.md

2 years agoUpdate tests
Georgi Gerganov [Sat, 8 Oct 2022 08:34:20 +0000 (11:34 +0300)]
Update tests

2 years agoci : add base model tests to GH Actions
Georgi Gerganov [Sat, 8 Oct 2022 08:17:41 +0000 (11:17 +0300)]
ci : add base model tests to GH Actions

2 years agoUpdate README.md
Georgi Gerganov [Sat, 8 Oct 2022 08:17:29 +0000 (11:17 +0300)]
Update README.md

2 years agoCreate README.md
Georgi Gerganov [Sat, 8 Oct 2022 08:16:37 +0000 (11:16 +0300)]
Create README.md

2 years agoAdding dummy models for testing purposes
Georgi Gerganov [Sat, 8 Oct 2022 07:57:42 +0000 (10:57 +0300)]
Adding dummy models for testing purposes

2 years agoAdding sanitizer tests
Georgi Gerganov [Sat, 8 Oct 2022 07:56:59 +0000 (10:56 +0300)]
Adding sanitizer tests

2 years agoCleanup CMakeLists.txt
Georgi Gerganov [Sat, 8 Oct 2022 06:00:59 +0000 (09:00 +0300)]
Cleanup CMakeLists.txt

2 years agocmake : fixes
Georgi Gerganov [Fri, 7 Oct 2022 21:21:16 +0000 (00:21 +0300)]
cmake : fixes

2 years agoci : add cmake builds
Georgi Gerganov [Fri, 7 Oct 2022 21:14:34 +0000 (00:14 +0300)]
ci : add cmake builds

2 years agowhisper : fix bug in token sampling logic
Georgi Gerganov [Fri, 7 Oct 2022 21:14:05 +0000 (00:14 +0300)]
whisper : fix bug in token sampling logic

Could overflow buffer

2 years agoAdd CMake support
Georgi Gerganov [Fri, 7 Oct 2022 20:53:12 +0000 (23:53 +0300)]
Add CMake support

2 years agoref #10 : option to keep context in "stream" example
Georgi Gerganov [Fri, 7 Oct 2022 19:30:44 +0000 (22:30 +0300)]
ref #10 : option to keep context in "stream" example

Seems the results become worse when we keep the context, so by default
this is not enabled

2 years agoref #10 : add "step" argument for "stream" example
Georgi Gerganov [Fri, 7 Oct 2022 19:07:24 +0000 (22:07 +0300)]
ref #10 : add "step" argument for "stream" example

Controls how often we run the inference.
By default, we run it every 3 seconds.

2 years agoref #16, #22 : add "offset" argument
Georgi Gerganov [Fri, 7 Oct 2022 19:00:40 +0000 (22:00 +0300)]
ref #16, #22 : add "offset" argument

Allows to start processing the input audio at some offset from the
beginning. Useful for splitting a long job into multiple tasks.

2 years agoref #11, #18, #26 : fix CACHE_LINE_SIZE constant
Georgi Gerganov [Fri, 7 Oct 2022 18:56:44 +0000 (21:56 +0300)]
ref #11, #18, #26 : fix CACHE_LINE_SIZE constant

2 years agoAdd CI using Github Actions
Georgi Gerganov [Fri, 7 Oct 2022 15:32:18 +0000 (18:32 +0300)]
Add CI using Github Actions

2 years agoref #22 : add option to provide multiple input .wav files
Georgi Gerganov [Wed, 5 Oct 2022 20:44:10 +0000 (23:44 +0300)]
ref #22 : add option to provide multiple input .wav files

2 years agoUpdate README.md
Georgi Gerganov [Wed, 5 Oct 2022 20:13:15 +0000 (23:13 +0300)]
Update README.md

2 years agoMinor updates
Georgi Gerganov [Wed, 5 Oct 2022 20:11:02 +0000 (23:11 +0300)]
Minor updates

2 years agowip : rpi4 support
Georgi Gerganov [Wed, 5 Oct 2022 18:34:41 +0000 (21:34 +0300)]
wip : rpi4 support

2 years agowip : improve makefile
Georgi Gerganov [Wed, 5 Oct 2022 17:41:35 +0000 (20:41 +0300)]
wip : improve makefile

2 years agoMerge pull request #20 from ArtyomZemlyak/master
Georgi Gerganov [Wed, 5 Oct 2022 04:27:29 +0000 (07:27 +0300)]
Merge pull request #20 from ArtyomZemlyak/master

Fix: main get language from cli args

2 years agoFix: main get n_threads from cli
Артём Земляк [Wed, 5 Oct 2022 02:47:48 +0000 (09:47 +0700)]
Fix: main get n_threads from cli

2 years agoFix: main get language from cli args
Артём Земляк [Wed, 5 Oct 2022 02:24:53 +0000 (09:24 +0700)]
Fix: main get language from cli args

2 years agoUpdate README.md
Georgi Gerganov [Tue, 4 Oct 2022 20:27:25 +0000 (23:27 +0300)]
Update README.md