]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
CUDA: loop over ne2*ne3 in case it overflows (llama/19538)
authorAman Gupta <redacted>
Fri, 13 Feb 2026 11:31:40 +0000 (17:01 +0530)
committerGeorgi Gerganov <redacted>
Sat, 14 Feb 2026 22:20:18 +0000 (00:20 +0200)
commit686bf7dcd72ce60a344fe110a7ea8da3c8231624
treee6468339f0024d3072e8678aaf32224ebadf9ce8
parent2285fa5abbd3be7d3c333cf120aa59d0b54638a6
CUDA: loop over ne2*ne3 in case it overflows (llama/19538)

* CUDA: loop over ne2*ne3 in case it overflows

* use fastdiv
src/ggml-cuda/convert.cu