]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
Fix more int overflow during quant (PPL/CUDA). (#6563)
authorDAN™ <redacted>
Sun, 28 Apr 2024 22:38:44 +0000 (18:38 -0400)
committerGitHub <redacted>
Sun, 28 Apr 2024 22:38:44 +0000 (00:38 +0200)
commite00b4a8f816ebc45b98a46e5f5231359b9a017e0
treef47ae2efca8cc0a5b82152b850ee8708fb500365
parent7bb36ccf91b8a2e92b182dd75624f1fd7cb205ac
Fix more int overflow during quant (PPL/CUDA). (#6563)

* Fix more int overflow during quant.

* Fix some more int overflow in softmax.

* Revert back to int64_t.
ggml-cuda/convert.cu
ggml-cuda/softmax.cu