]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
CUDA: add set rows for f32 and f16 (llama/14551)
authorAman Gupta <redacted>
Sat, 12 Jul 2025 13:31:38 +0000 (21:31 +0800)
committerGeorgi Gerganov <redacted>
Sat, 19 Jul 2025 14:47:23 +0000 (17:47 +0300)
commitac6782b3824f874a8e7f8a690310ad474c5c3823
tree86a85d02ef765dded8703b82f2a94b65abe5bd3e
parent3323219cd3cc050e5c7133cd4fc1e50d1f590faf
CUDA: add set rows for f32 and f16 (llama/14551)

* CUDA: add set rows for f32 and f16

* Review: change kernel params, use strides from host

* Use 1-d kernel

* Review: use int64_t for blockDim.x, rename nb->s for clarity
src/ggml-cuda/ggml-cuda.cu
src/ggml-cuda/set-rows.cu [new file with mode: 0644]
src/ggml-cuda/set-rows.cuh [new file with mode: 0644]