]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
CUDA: MMQ support for iq4_nl, iq4_xs (llama/8278)
authorJohannes Gäßler <redacted>
Fri, 5 Jul 2024 07:06:31 +0000 (09:06 +0200)
committerGeorgi Gerganov <redacted>
Mon, 8 Jul 2024 10:03:28 +0000 (13:03 +0300)
commitd9aecdf3f55f1c639a40267428079913ce8fb177
tree1773cb00388b8bd4c1f923588d61e4661f3b7331
parenta0e55a4255638ad89d3057f243016e9b9d7c3dfc
CUDA: MMQ support for iq4_nl, iq4_xs (llama/8278)
src/ggml-cuda/fattn-common.cuh
src/ggml-cuda/mmq.cu
src/ggml-cuda/mmq.cuh
src/ggml-cuda/template-instances/generate_cu_files.py
src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu [new file with mode: 0644]
src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu [new file with mode: 0644]
src/ggml-cuda/vecdotq.cuh