git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit

overview / pkg / ggml / sources / whisper.cpp / commit

author	Georgi Gerganov <redacted>
	Wed, 15 May 2024 06:38:19 +0000 (09:38 +0300)
committer	GitHub <redacted>
	Wed, 15 May 2024 06:38:19 +0000 (09:38 +0300)
commit	7094ea5e750266e16c16c7aecac8fc03294ecaa3
tree	1166f219a2d57b2da63273ab840e9c4701c28a84	tree
parent	9d5771ae43d7fc7cca9d31dd924b13a29144e476	commit \| diff

whisper : use flash attention (#2152)

* whisper : use flash attention in the encoder

* whisper : add kv_pad

* whisper : remove extra backend instance (huh?)

* whisper : use FA for cross-attention

* whisper : use FA for self-attention

* whisper : simplify encoder FA

* whisper : add flash_attn runtime parameter

* scripts : add bench log

* scripts : add M1 Pro bench log

13 files changed:

examples/bench/bench.cpp		diff \| blob \| history
examples/command/command.cpp		diff \| blob \| history
examples/lsp/lsp.cpp		diff \| blob \| history
examples/main/main.cpp		diff \| blob \| history
examples/server/server.cpp		diff \| blob \| history
examples/stream/stream.cpp		diff \| blob \| history
examples/talk-llama/talk-llama.cpp		diff \| blob \| history
examples/talk/talk.cpp		diff \| blob \| history
examples/wchess/wchess.cmd/wchess.cmd.cpp		diff \| blob \| history
scripts/bench-all-gg.txt	[new file with mode: 0644]	blob
scripts/bench-all.sh		diff \| blob \| history
whisper.cpp		diff \| blob \| history
whisper.h		diff \| blob \| history

Packaging of ggerganov/whisper.cpp

RSS Atom