]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
CUDA: revert part of the RDNA1 optimizations (llama/8309)
authorDaniele <redacted>
Fri, 5 Jul 2024 07:06:09 +0000 (07:06 +0000)
committerGeorgi Gerganov <redacted>
Mon, 8 Jul 2024 10:03:28 +0000 (13:03 +0300)
commita0e55a4255638ad89d3057f243016e9b9d7c3dfc
tree1fac310afe026285633fc2fb083886599c9bbe7e
parentace94813b15537f8aa0c21285f581d2181c2ac75
CUDA: revert part of the RDNA1 optimizations (llama/8309)

The change on the launch_bounds was causing a small performance drop in perplexity of 25 t/s
src/ggml-cuda/mmq.cuh