git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Georgi Gerganov <redacted>
	Sat, 22 Apr 2023 07:55:35 +0000 (10:55 +0300)
committer	GitHub <redacted>
	Sat, 22 Apr 2023 07:55:35 +0000 (10:55 +0300)
commit	955ef9a5d53d8f911fe00580ac9bd0caa56430af
tree	d60f9ac6b426c8f3e59992691d7686c2d7ff89db	tree
parent	c5aa5e577741d0359ad26ec50b9e21a74c65d911	commit \| diff

ggml : alternative Q4_3 implementation using modified Q8_0 (#1109)

* ggml : prefer vzip to vuzp

This way we always use the same type of instruction across all quantizations

* ggml : alternative Q4_3 implementation using modified Q8_0

* ggml : fix Q4_3 scalar imlpementation

* ggml : slight improvement of Q4_3 - no need for loop unrolling

* ggml : fix AVX paths for Q8_0 quantization

ggml.c

diff | blob | history

Packaging of ggml-org/llama.cpp

RSS Atom