]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
CUDA: fix MMQ stream-k rounding if ne00 % 128 != 0 (llama/8311)
authorJohannes Gäßler <redacted>
Fri, 5 Jul 2024 07:05:34 +0000 (09:05 +0200)
committerGeorgi Gerganov <redacted>
Mon, 8 Jul 2024 11:53:55 +0000 (14:53 +0300)
commite89fdceec218fb94e7fd3afc184826b168ab7406
treee9165a7756a7be670dfd4737d9a39ff9adb6b918
parent29a2739d279c871b0a0ec5fb00586cd158aab0e7
CUDA: fix MMQ stream-k rounding if ne00 % 128 != 0 (llama/8311)
ggml/src/ggml-cuda/mmq.cuh