git.djapps.eu Git - pkg/ggml/sources/ggml/commit

]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit

overview / pkg / ggml / sources / ggml / commit

author	Srihari-mcw <redacted>
	Thu, 20 Mar 2025 11:35:34 +0000 (17:05 +0530)
committer	Georgi Gerganov <redacted>
	Thu, 27 Mar 2025 07:35:24 +0000 (09:35 +0200)
commit	fcbd7b1a7455bce59333f474c7c48cc9a4dacd5d
tree	a5214b8acfd2415e7fc06bc3f9e097a5d3c74fcd	tree
parent	807726590102f2df4e93ef8f5e100d28f757994b	commit \| diff

ggml : block interleaving support for Q4_K quantization for x86 AVX2 architecture (llama/12332)

* Add block interleaving support for Q4_K quantization

* Remove whitespaces and fix CI/CD issues

* Update pointer of bsums from int16_t to const int16_t

* Add vector version of quantize_q8_K_4x8 function

* Update code formatting based on review comments

src/ggml-cpu/ggml-cpu-aarch64.cpp

diff | blob | history

Packaging of ggml-org/ggml

RSS Atom