]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
(Bugfix, ggml-cuda) Pool alloc count fix + small size computation type adjustment...
authorpl752 <redacted>
Sat, 3 Jan 2026 10:13:40 +0000 (15:13 +0500)
committerGeorgi Gerganov <redacted>
Sun, 11 Jan 2026 09:02:08 +0000 (11:02 +0200)
commita90aefcb055624e3e6786cce84c223acdad4cf3a
treeb6889d684d82abafdb555afe135667ad1c9bc43a
parentf0858b15cfda3b9c4b236faa8430f4a31954be57
(Bugfix, ggml-cuda) Pool alloc count fix + small size computation type adjustment (llama/18559)

* CUDA: Fixed obj byte size instead of obj count being passed to pool alloc (fattn-common, dst_tmp_meta)

* CUDA: Explicitly casted some of the int alloc counts before multiplication in argsort

---------

Co-authored-by: pl752 <redacted>
src/ggml-cuda/argsort.cu
src/ggml-cuda/fattn-common.cuh