]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
CUDA: fuse adds, fuse add with rms norm (llama/15631)
authorAman Gupta <redacted>
Fri, 29 Aug 2025 03:35:58 +0000 (11:35 +0800)
committerGeorgi Gerganov <redacted>
Sat, 20 Sep 2025 10:42:44 +0000 (13:42 +0300)
commit6d7ddaf793b1b9b467ca57bd415d438b321c4ea9
treebe7fc3fd484d2b95e95aacf50403c9ed23b5d983
parentdc9f55bbb036c24d602272b612aea0cb4fc59dc6
CUDA: fuse adds, fuse add with rms norm (llama/15631)

* CUDA: fused add with rms_norm_mul

* Non-broadcast fuse works

* Add fused adds

* format

* Remove n_fuse from template params

* Address review comments

* Move template inside binbcast
ggml/src/ggml-cuda/binbcast.cu
ggml/src/ggml-cuda/binbcast.cuh
ggml/src/ggml-cuda/ggml-cuda.cu
ggml/src/ggml-cuda/norm.cu
ggml/src/ggml-cuda/norm.cuh