]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
CUDA: fix bug in rms_norm fusion (#15660)
authorAman Gupta <redacted>
Fri, 29 Aug 2025 13:30:06 +0000 (21:30 +0800)
committerGitHub <redacted>
Fri, 29 Aug 2025 13:30:06 +0000 (21:30 +0800)
commit81017865ee444cf49ce0136f2be1e41a0270ff91
treea05e6014f850ef01f6f6b911894d448cc130ee23
parent60e5eee31f1af9bb579ac45380e3857d610020b9
CUDA: fix bug in rms_norm fusion (#15660)

* CUDA: fix bug in rms_norm fusion

* Fix bug for OP_REPEAT

* Fix index for add
ggml/src/ggml-cuda/binbcast.cu
ggml/src/ggml-cuda/ggml-cuda.cu
ggml/src/ggml-cuda/norm.cu