]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
CUDA: Do not mutate cgraph for fused ADDs (llama/19566)
authorOliver Simons <redacted>
Fri, 13 Feb 2026 09:37:55 +0000 (10:37 +0100)
committerGeorgi Gerganov <redacted>
Sun, 15 Feb 2026 19:44:37 +0000 (21:44 +0200)
commit3eb4905af17b327a3b78376ed1ff8883b77e6572
treed5f847d2bb3ac8ceefe8f795559ff82231121800
parent0e94faa19cdcb1e1f779b4c86a3087b715785e19
CUDA: Do not mutate cgraph for fused ADDs (llama/19566)

* Do not mutate cgraph for fused ADDs

1. We should try to minimize in-place changes to the incoming
   ggml_cgraph where possible (those should happen in graph_optimize)
2. Modifying in-place leads to an additional, unnecessary graph capture
   step as we store the properties before modifying the graph in-place
   in the cuda-backend

* Assert ggml_tensor is trivially copyable

* Update ggml/src/ggml-cuda/ggml-cuda.cu

Co-authored-by: Aman Gupta <redacted>
---------

Co-authored-by: Aman Gupta <redacted>
ggml/src/ggml-cuda/ggml-cuda.cu