]>
git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/log
summary |
shortlog | log |
commit |
commitdiff |
tree
first ⋅ prev ⋅ next
Georgi Gerganov [Wed, 5 Oct 2022 04:27:29 +0000 (07:27 +0300)]
Merge pull request #20 from ArtyomZemlyak/master
Fix: main get language from cli args
Артём Земляк [Wed, 5 Oct 2022 02:47:48 +0000 (09:47 +0700)]
Fix: main get n_threads from cli
Артём Земляк [Wed, 5 Oct 2022 02:24:53 +0000 (09:24 +0700)]
Fix: main get language from cli args
Georgi Gerganov [Tue, 4 Oct 2022 20:27:25 +0000 (23:27 +0300)]
Update README.md
Georgi Gerganov [Tue, 4 Oct 2022 20:16:33 +0000 (23:16 +0300)]
Improve result printing
Georgi Gerganov [Tue, 4 Oct 2022 19:43:37 +0000 (22:43 +0300)]
Extend C-style API with full inference methods
Georgi Gerganov [Tue, 4 Oct 2022 17:35:01 +0000 (20:35 +0300)]
Initial C-style interface for whisper.cpp
Georgi Gerganov [Sun, 2 Oct 2022 17:11:17 +0000 (20:11 +0300)]
ref #10 : handle Ctrl+C in "stream" app
Georgi Gerganov [Sun, 2 Oct 2022 15:19:22 +0000 (18:19 +0300)]
Update README.md
Georgi Gerganov [Sun, 2 Oct 2022 14:55:45 +0000 (17:55 +0300)]
ref #10 : quick-and-dirty attempt for real-time audio transciption
- Processes input in chunks of 3 seconds.
- Padding audio with silence
- Uses 1 second audio from previous pass
- No text context
Georgi Gerganov [Sun, 2 Oct 2022 14:46:21 +0000 (17:46 +0300)]
Fix bug in FFT
The FFT routine does not work for odd N
Solution is to add DFT and use it when N is odd
Georgi Gerganov [Sat, 1 Oct 2022 05:41:57 +0000 (08:41 +0300)]
Fix reading of stereo WAV files
Georgi Gerganov [Fri, 30 Sep 2022 21:01:04 +0000 (00:01 +0300)]
Update README.md
Georgi Gerganov [Fri, 30 Sep 2022 17:37:29 +0000 (20:37 +0300)]
Bug fix
Longer prompts could cause out-of-bounds access
Georgi Gerganov [Fri, 30 Sep 2022 16:33:09 +0000 (19:33 +0300)]
Reduce memory usage even more + better sampling
- The encode/decode memory buffers are now reused
- If the 30-sec segment goes for too long without a timestamp token, we
force one. Improves transcription for large model
- Stereo support
- Add "micro-machines.wav" sample
Georgi Gerganov [Thu, 29 Sep 2022 20:48:01 +0000 (23:48 +0300)]
Update README.md
Georgi Gerganov [Thu, 29 Sep 2022 20:37:59 +0000 (23:37 +0300)]
Update README.md
Georgi Gerganov [Thu, 29 Sep 2022 20:09:04 +0000 (23:09 +0300)]
ref #4 : added transcription timestamps
Can be turned off with "-nt" argument.
Performance has also improved.
Georgi Gerganov [Wed, 28 Sep 2022 19:06:09 +0000 (22:06 +0300)]
Merge pull request #3 from cdosoftei/master
Pass -pthread to linker
cdosoftei [Wed, 28 Sep 2022 19:01:54 +0000 (15:01 -0400)]
Pass -pthread to linker
Georgi Gerganov [Wed, 28 Sep 2022 18:13:32 +0000 (21:13 +0300)]
Update README.md
Georgi Gerganov [Wed, 28 Sep 2022 17:46:05 +0000 (20:46 +0300)]
Flash + language support (ref #2)
- Achieved big performance improvement + memory usage reduction
- Can now translate / transcribe different languages
Georgi Gerganov [Mon, 26 Sep 2022 08:58:44 +0000 (11:58 +0300)]
ref #1 : add -pthread to compilation flags
Georgi Gerganov [Mon, 26 Sep 2022 06:36:51 +0000 (09:36 +0300)]
Update README.md and simplify usage
Georgi Gerganov [Sun, 25 Sep 2022 19:35:26 +0000 (22:35 +0300)]
Create README.md
Georgi Gerganov [Sun, 25 Sep 2022 19:15:44 +0000 (22:15 +0300)]
Create LICENSE
Georgi Gerganov [Sun, 25 Sep 2022 18:23:15 +0000 (21:23 +0300)]
Initial release