git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit

overview / pkg / ggml / sources / whisper.cpp / commit

author	Oliver Simons <redacted>
	Fri, 18 Jul 2025 11:35:32 +0000 (13:35 +0200)
committer	Georgi Gerganov <redacted>
	Sat, 19 Jul 2025 21:23:50 +0000 (00:23 +0300)
commit	d4a7ea1634b0b07b52be9fb889d9f437177cc021
tree	9e0e25f986eeaa183fe067d3bb3337a4232e3bfa	tree
parent	9a07cb064aa1f5b94b0868ee294b98a2e5b01b9a	commit \| diff

cuda : Fix Gemma3n not executed as CUDA_GRAPH on NVGPUs (llama/14741)

* Fix Gemma3n not executed as CUDA_GRAPH on NVGPUs

Gemma3n uses Matrix-Matrix addition as part of their input processing,
wrongly triggering CUDA_GRAPH disablement on NVGPUs even when batch-size
of 1 is used.

* Exclude `project_per_layer_input` by matching node names

This ensures that all other graphs which don't exhibit this pattern do
not have their behavior changed.

* Revert unnecessary formatting changes

ggml/src/ggml-cuda/ggml-cuda.cu

diff | blob | history

Packaging of ggerganov/whisper.cpp

RSS Atom