]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
Faster Q5_K and Q6_K on Metal (#2294)
authorKawrakow <redacted>
Thu, 20 Jul 2023 15:19:45 +0000 (18:19 +0300)
committerGitHub <redacted>
Thu, 20 Jul 2023 15:19:45 +0000 (18:19 +0300)
commite782c9e735f93ab4767ffc37462c523b73a17ddc
treeb5c87fd34707ec9aa7a7a9716b2ef96157c033c5
parent785829dfe8baf0213f2ff66963d28c62f92d7930
Faster Q5_K and Q6_K on Metal (#2294)

* Faster Q6_K on Metal

* Faster Q5_K on Metal

* Another Q5_K speedup

---------

Co-authored-by: Iwan Kawrakow <redacted>
ggml-metal.m
ggml-metal.metal