]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
CUDA: fuse rope + set_rows (llama/16884)
authorAman Gupta <redacted>
Thu, 13 Nov 2025 00:50:01 +0000 (08:50 +0800)
committerGeorgi Gerganov <redacted>
Mon, 17 Nov 2025 13:34:43 +0000 (15:34 +0200)
commit26c1225719c619856109055b7875540e871e638a
tree7a54602733afc0d57faec3fa7b9ccbabf5e1d010
parent505ad1f8c65542b94096510166a5ff97d91225db
CUDA: fuse rope + set_rows (llama/16884)

* CUDA: add fused rope

* move k forward_expand up

* create helper function instead of re-using params

* make assert statement more in line with comment

* rope_norm: coalesced writes to global mem
src/ggml-cuda/ggml-cuda.cu
src/ggml-cuda/rope.cu
src/ggml-cuda/rope.cuh