]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
CUDA: add set rows for f32 and f16 (llama/14551)
authorAman Gupta <redacted>
Sat, 12 Jul 2025 13:31:38 +0000 (21:31 +0800)
committerGeorgi Gerganov <redacted>
Sat, 19 Jul 2025 21:23:50 +0000 (00:23 +0300)
commit66b3a39bdc166e42874a257b796559d1c5a382bf
tree5ee605f3a195362791671678b6360241f62b0e49
parent032697b9a850dc2615555e2a93a683cc3dd58559
CUDA: add set rows for f32 and f16 (llama/14551)

* CUDA: add set rows for f32 and f16

* Review: change kernel params, use strides from host

* Use 1-d kernel

* Review: use int64_t for blockDim.x, rename nb->s for clarity
ggml/src/ggml-cuda/ggml-cuda.cu
ggml/src/ggml-cuda/set-rows.cu [new file with mode: 0644]
ggml/src/ggml-cuda/set-rows.cuh [new file with mode: 0644]