]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
Fix more int overflow during quant (PPL/CUDA). (llama/6563)
authorDAN™ <redacted>
Sun, 28 Apr 2024 22:38:44 +0000 (18:38 -0400)
committerGeorgi Gerganov <redacted>
Sat, 11 May 2024 18:30:08 +0000 (21:30 +0300)
commit0aafe4f5356603cd12ee8f6462d8ddc5950ac9ed
tree74ede5937cb0e00065e4aaa5c87048dbe9b54bf9
parente6db6ac63a079026ee56a10360c80fa980275743
Fix more int overflow during quant (PPL/CUDA). (llama/6563)

* Fix more int overflow during quant.

* Fix some more int overflow in softmax.

* Revert back to int64_t.
src/ggml-cuda/convert.cu
src/ggml-cuda/softmax.cu