]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/log
pkg/ggml/sources/whisper.cpp
2 years agoMinor updates
Georgi Gerganov [Wed, 5 Oct 2022 20:11:02 +0000 (23:11 +0300)]
Minor updates

2 years agowip : rpi4 support
Georgi Gerganov [Wed, 5 Oct 2022 18:34:41 +0000 (21:34 +0300)]
wip : rpi4 support

2 years agowip : improve makefile
Georgi Gerganov [Wed, 5 Oct 2022 17:41:35 +0000 (20:41 +0300)]
wip : improve makefile

2 years agoMerge pull request #20 from ArtyomZemlyak/master
Georgi Gerganov [Wed, 5 Oct 2022 04:27:29 +0000 (07:27 +0300)]
Merge pull request #20 from ArtyomZemlyak/master

Fix: main get language from cli args

2 years agoFix: main get n_threads from cli
Артём Земляк [Wed, 5 Oct 2022 02:47:48 +0000 (09:47 +0700)]
Fix: main get n_threads from cli

2 years agoFix: main get language from cli args
Артём Земляк [Wed, 5 Oct 2022 02:24:53 +0000 (09:24 +0700)]
Fix: main get language from cli args

2 years agoUpdate README.md
Georgi Gerganov [Tue, 4 Oct 2022 20:27:25 +0000 (23:27 +0300)]
Update README.md

2 years agoImprove result printing
Georgi Gerganov [Tue, 4 Oct 2022 20:16:33 +0000 (23:16 +0300)]
Improve result printing

2 years agoExtend C-style API with full inference methods
Georgi Gerganov [Tue, 4 Oct 2022 19:43:37 +0000 (22:43 +0300)]
Extend C-style API with full inference methods

2 years agoInitial C-style interface for whisper.cpp
Georgi Gerganov [Tue, 4 Oct 2022 17:35:01 +0000 (20:35 +0300)]
Initial C-style interface for whisper.cpp

2 years agoref #10 : handle Ctrl+C in "stream" app
Georgi Gerganov [Sun, 2 Oct 2022 17:11:17 +0000 (20:11 +0300)]
ref #10 : handle Ctrl+C in "stream" app

2 years agoUpdate README.md
Georgi Gerganov [Sun, 2 Oct 2022 15:19:22 +0000 (18:19 +0300)]
Update README.md

2 years agoref #10 : quick-and-dirty attempt for real-time audio transciption
Georgi Gerganov [Sun, 2 Oct 2022 14:55:45 +0000 (17:55 +0300)]
ref #10 : quick-and-dirty attempt for real-time audio transciption

- Processes input in chunks of 3 seconds.
- Padding audio with silence
- Uses 1 second audio from previous pass
- No text context

2 years agoFix bug in FFT
Georgi Gerganov [Sun, 2 Oct 2022 14:46:21 +0000 (17:46 +0300)]
Fix bug in FFT

The FFT routine does not work for odd N
Solution is to add DFT and use it when N is odd

2 years agoFix reading of stereo WAV files
Georgi Gerganov [Sat, 1 Oct 2022 05:41:57 +0000 (08:41 +0300)]
Fix reading of stereo WAV files

2 years agoUpdate README.md
Georgi Gerganov [Fri, 30 Sep 2022 21:01:04 +0000 (00:01 +0300)]
Update README.md

2 years agoBug fix
Georgi Gerganov [Fri, 30 Sep 2022 17:37:29 +0000 (20:37 +0300)]
Bug fix

Longer prompts could cause out-of-bounds access

2 years agoReduce memory usage even more + better sampling
Georgi Gerganov [Fri, 30 Sep 2022 16:33:09 +0000 (19:33 +0300)]
Reduce memory usage even more + better sampling

- The encode/decode memory buffers are now reused
- If the 30-sec segment goes for too long without a timestamp token, we
  force one. Improves transcription for large model
- Stereo support
- Add "micro-machines.wav" sample

2 years agoUpdate README.md
Georgi Gerganov [Thu, 29 Sep 2022 20:48:01 +0000 (23:48 +0300)]
Update README.md

2 years agoUpdate README.md
Georgi Gerganov [Thu, 29 Sep 2022 20:37:59 +0000 (23:37 +0300)]
Update README.md

2 years agoref #4 : added transcription timestamps
Georgi Gerganov [Thu, 29 Sep 2022 20:09:04 +0000 (23:09 +0300)]
ref #4 : added transcription timestamps

Can be turned off with "-nt" argument.
Performance has also improved.

2 years agoMerge pull request #3 from cdosoftei/master
Georgi Gerganov [Wed, 28 Sep 2022 19:06:09 +0000 (22:06 +0300)]
Merge pull request #3 from cdosoftei/master

Pass -pthread to linker

2 years agoPass -pthread to linker
cdosoftei [Wed, 28 Sep 2022 19:01:54 +0000 (15:01 -0400)]
Pass -pthread to linker

2 years agoUpdate README.md
Georgi Gerganov [Wed, 28 Sep 2022 18:13:32 +0000 (21:13 +0300)]
Update README.md

2 years agoFlash + language support (ref #2)
Georgi Gerganov [Wed, 28 Sep 2022 17:46:05 +0000 (20:46 +0300)]
Flash + language support (ref #2)

- Achieved big performance improvement + memory usage reduction
- Can now translate / transcribe different languages

2 years agoref #1 : add -pthread to compilation flags
Georgi Gerganov [Mon, 26 Sep 2022 08:58:44 +0000 (11:58 +0300)]
ref #1 : add -pthread to compilation flags

2 years agoUpdate README.md and simplify usage
Georgi Gerganov [Mon, 26 Sep 2022 06:36:51 +0000 (09:36 +0300)]
Update README.md and simplify usage

2 years agoCreate README.md
Georgi Gerganov [Sun, 25 Sep 2022 19:35:26 +0000 (22:35 +0300)]
Create README.md

2 years agoCreate LICENSE
Georgi Gerganov [Sun, 25 Sep 2022 19:15:44 +0000 (22:15 +0300)]
Create LICENSE

2 years agoInitial release
Georgi Gerganov [Sun, 25 Sep 2022 18:23:15 +0000 (21:23 +0300)]
Initial release