]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
CUDA: fix MMQ stream-k rounding if ne00 % 128 != 0 (llama/8311)
authorJohannes Gäßler <redacted>
Fri, 5 Jul 2024 07:05:34 +0000 (09:05 +0200)
committerGeorgi Gerganov <redacted>
Mon, 8 Jul 2024 10:03:28 +0000 (13:03 +0300)
commitace94813b15537f8aa0c21285f581d2181c2ac75
tree2dc683019a2416f8e34536877778ace122b376bb
parentea98f39dedcd0391c75ca4f1c5a35d5becea8dbc
CUDA: fix MMQ stream-k rounding if ne00 % 128 != 0 (llama/8311)
src/ggml-cuda/mmq.cuh