git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

CUDA: MMQ support for iq4_nl, iq4_xs (#8278)

ggml/src/ggml-cuda/fattn-common.cuh		diff \| blob \| history
ggml/src/ggml-cuda/mmq.cu		diff \| blob \| history
ggml/src/ggml-cuda/mmq.cuh		diff \| blob \| history
ggml/src/ggml-cuda/template-instances/generate_cu_files.py		diff \| blob \| history
ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu	[new file with mode: 0644]	blob
ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu	[new file with mode: 0644]	blob
ggml/src/ggml-cuda/vecdotq.cuh		diff \| blob \| history

Packaging of ggml-org/llama.cpp