git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Johannes Gäßler <redacted>
	Mon, 1 Jul 2024 18:39:06 +0000 (20:39 +0200)
committer	GitHub <redacted>
	Mon, 1 Jul 2024 18:39:06 +0000 (20:39 +0200)
commit	cb5fad4c6c2cbef92e9b8b63449e1cb7664e4846
tree	462520fd21f3ce9142b61a4b1e700fea577d58f1	tree
parent	dae57a1ebc1c9bd5693ab999e19d77c5506ae559	commit \| diff

CUDA: refactor and optimize IQ MMVQ (#8215)

* CUDA: refactor and optimize IQ MMVQ

* uint -> uint32_t

* __dp4a -> ggml_cuda_dp4a

* remove MIN_CC_DP4A checks

* change default

* try CI fix

Packaging of ggml-org/llama.cpp

RSS Atom

ggml/src/ggml-common.h		diff \| blob \| history
ggml/src/ggml-cuda.cu		diff \| blob \| history
ggml/src/ggml-cuda/common.cuh		diff \| blob \| history
ggml/src/ggml-cuda/fattn-common.cuh		diff \| blob \| history
ggml/src/ggml-cuda/mmvq.cu		diff \| blob \| history
ggml/src/ggml-cuda/vecdotq.cuh		diff \| blob \| history
ggml/src/ggml-sycl/mmvq.cpp		diff \| blob \| history
ggml/src/ggml-sycl/vecdotq.hpp		diff \| blob \| history