]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
whisper : add integer quantization support (#540)
authorGeorgi Gerganov <redacted>
Sun, 30 Apr 2023 15:51:57 +0000 (18:51 +0300)
committerGitHub <redacted>
Sun, 30 Apr 2023 15:51:57 +0000 (18:51 +0300)
commit794b162a464cc5f3af62d012bc243641fd18b60a
tree50f07e79539088c9987a6afd85aa788012256332
parent5fd1bdd7fc501d1a94dcedf80ec539f696deaf3f
whisper : add integer quantization support (#540)

* whisper : add integer quantization support

* examples : add common-ggml + prepare to add "quantize" tool

* whisper : quantization tool ready

* whisper : fix F32 support

* whisper : try to fix shared lib linkage

* wasm : update quantized models to Q5

* bench.wasm : remove "medium" button

* bench.wasm : fix custom model button

* ggml : add Q5_0 and Q5_1 WASM SIMD

* wasm : add quantized models to all WASM examples

* wasm : bump DB version number to 2

* talk-llama : update example to latest llama.cpp

* node : increase test timeout to 10s

* readme : add information for model quantization

* wasm : add links to other examples
42 files changed:
.gitignore
CMakeLists.txt
Makefile
README.md
bindings/javascript/whisper.js
examples/CMakeLists.txt
examples/addon.node/__test__/whisper.spec.js
examples/bench.wasm/CMakeLists.txt
examples/bench.wasm/index-tmpl.html
examples/command.wasm/index-tmpl.html
examples/common-ggml.cpp [new file with mode: 0644]
examples/common-ggml.h [new file with mode: 0644]
examples/common.cpp
examples/common.h
examples/helpers.js
examples/main/main.cpp
examples/quantize/CMakeLists.txt [new file with mode: 0644]
examples/quantize/README.md [new file with mode: 0644]
examples/quantize/quantize.cpp [new file with mode: 0644]
examples/stream.wasm/index-tmpl.html
examples/talk-llama/llama-util.h [new file with mode: 0644]
examples/talk-llama/llama.cpp
examples/talk-llama/llama.h
examples/talk-llama/llama_internal.h [deleted file]
examples/talk-llama/llama_util.h [deleted file]
examples/talk-llama/talk-llama.cpp
examples/talk.wasm/CMakeLists.txt
examples/talk.wasm/gpt-2.cpp
examples/talk.wasm/gpt-2.h
examples/talk.wasm/index-tmpl.html
examples/talk/CMakeLists.txt
examples/talk/gpt-2.cpp
examples/talk/gpt-2.h
examples/whisper.wasm/CMakeLists.txt
examples/whisper.wasm/emscripten.cpp
examples/whisper.wasm/index-tmpl.html
extra/quantize-all.sh [new file with mode: 0755]
extra/sync-ggml.sh
ggml.c
ggml.h
whisper.cpp
whisper.h