git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit

overview / pkg / ggml / sources / whisper.cpp / commit

author	Max Krasnyansky <redacted>
	Thu, 30 Oct 2025 12:26:05 +0000 (05:26 -0700)
committer	Georgi Gerganov <redacted>
	Sun, 9 Nov 2025 21:38:03 +0000 (23:38 +0200)
commit	f1fdb91e95f9941fedbdb718dfa2e233716639b0
tree	db6814c9a26748296c2945bd2b4eaaada7aa7c7e	tree
parent	f7dfa39104dbb756fc0d839698edaffaf3c7ddaa	commit \| diff

cpu: introduce chunking for flash attention (llama/16829)

Factor out the core FA loop into flash_atten_f16_one_chunk and add an outter loop
on top that handles the chunks.

ggml/src/ggml-cpu/ops.cpp

diff | blob | history

Packaging of ggerganov/whisper.cpp

RSS Atom