git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit

overview / pkg / ggml / sources / whisper.cpp / commit

author	agray3 <redacted>
	Wed, 15 May 2024 13:44:49 +0000 (14:44 +0100)
committer	Georgi Gerganov <redacted>
	Sun, 16 Jun 2024 15:19:48 +0000 (18:19 +0300)
commit	8d55ccdb8cafe5a5e9b5b8ed5b6ce0e9a6a642af
tree	a0f9f5fb0ea4fc07932b57939fd87111c0743e4d	tree
parent	37a72cb1703edf9bdc97d3873f7fec72f542edc0	commit \| diff

Avoid unnecessarily disabling CUDA graphs (llama/7302)

As discussed in PR #6766, CUDA graphs were being disabled in the presence of long prompts.
This fixes the issue by avoiding the consective update counter from incrementing unnecessarily
for tokens in which cuda graphs are disabled due to batch size > 1.

ggml-cuda.cu

diff | blob | history

Packaging of ggerganov/whisper.cpp

RSS Atom