]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
CUDA: fuse rope + set_rows (llama/16884)
authorAman Gupta <redacted>
Thu, 13 Nov 2025 00:50:01 +0000 (08:50 +0800)
committerGeorgi Gerganov <redacted>
Mon, 17 Nov 2025 19:05:46 +0000 (21:05 +0200)
commit84275fc4930230dc6366531688d15e20f7337789
tree2ee18d87bdb73e9121d11a5def699bc658f43f01
parent566c4c4469f6deb54dfa39cef306840f1ecbff36
CUDA: fuse rope + set_rows (llama/16884)

* CUDA: add fused rope

* move k forward_expand up

* create helper function instead of re-using params

* make assert statement more in line with comment

* rope_norm: coalesced writes to global mem
ggml/src/ggml-cuda/ggml-cuda.cu
ggml/src/ggml-cuda/rope.cu
ggml/src/ggml-cuda/rope.cuh