]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
cuda : enable CUDA graphs for MMID 1 <= BS <= 4 (llama/19645)
authorGeorgi Gerganov <redacted>
Tue, 17 Feb 2026 10:31:49 +0000 (12:31 +0200)
committerGeorgi Gerganov <redacted>
Fri, 27 Feb 2026 18:57:58 +0000 (20:57 +0200)
commitcf4bd07028c007d42aeaa1f987a8159f7cb0cc92
tree0c7100a80b026e8422f6551b513ecec065aeb92c
parent5ee5748722ab9674c6b1c9147bea9ffe06d6ebf8
cuda : enable CUDA graphs for MMID 1 <= BS <= 4 (llama/19645)

* cuda : enable CUDA graphs for MMID BS <= 4

* cont : add stream capture check

Co-authored-by: Oliver Simons <redacted>
* cont : add MMVQ_MMID_MAX_BATCH_SIZE

---------

Co-authored-by: Oliver Simons <redacted>
ggml/src/ggml-cuda/ggml-cuda.cu
ggml/src/ggml-cuda/mmvq.cuh