]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
cuda: refactored ssm_scan and use CUB (llama/13291)
authorDavid Zhao <redacted>
Sat, 9 Aug 2025 18:29:43 +0000 (13:29 -0500)
committerGeorgi Gerganov <redacted>
Mon, 18 Aug 2025 17:30:45 +0000 (20:30 +0300)
commit457eadfe6f61b6492e3ae5d0585b8e221c7e2dfa
treece47432ab442de51aadd1e2fc11c5ec55f324de9
parent93c7a08019ba3f5ff6addf71dcbcd5b2109481ab
cuda: refactored ssm_scan and use CUB (llama/13291)

* cuda: refactored ssm_scan to use CUB

* fixed compilation error when when not using CUB

* assign L to constant and use size_t instead of int

* deduplicated functions

* change min blocks per mp to 1

* Use cub load and store warp transpose

* suppress clang warning
ggml/src/ggml-cuda/ssm-scan.cu