]>
author | Johannes Gäßler <redacted> | |
Fri, 5 Jul 2024 07:06:31 +0000 (09:06 +0200) | ||
committer | Georgi Gerganov <redacted> | |
Mon, 8 Jul 2024 10:03:28 +0000 (13:03 +0300) | ||
commit | d9aecdf3f55f1c639a40267428079913ce8fb177 | |
tree | 1773cb00388b8bd4c1f923588d61e4661f3b7331 | tree |
parent | a0e55a4255638ad89d3057f243016e9b9d7c3dfc | commit | diff |
src/ggml-cuda/fattn-common.cuh | diff | blob | history | |
src/ggml-cuda/mmq.cu | diff | blob | history | |
src/ggml-cuda/mmq.cuh | diff | blob | history | |
src/ggml-cuda/template-instances/generate_cu_files.py | diff | blob | history | |
src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu | [new file with mode: 0644] | blob |
src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu | [new file with mode: 0644] | blob |
src/ggml-cuda/vecdotq.cuh | diff | blob | history |