git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

author	compilade <redacted>
	Thu, 8 Aug 2024 17:33:09 +0000 (13:33 -0400)
committer	GitHub <redacted>
	Thu, 8 Aug 2024 17:33:09 +0000 (13:33 -0400)
commit	3a14e00366399040a139c67dd5951177a8cb5695
tree	600c6b5efabfbec3787fe4f83c8db30d0004dd85	tree
parent	afd27f01fe832ece3d07ef03b7d34a9e80c4a895	commit \| diff

gguf-py : simplify support for quant types (#8838)

* gguf-py : use classes for quants

* convert_hf : simplify internal quantization type selection

* gguf-py : fix flake8 lint

* gguf-py : fix BF16 numpy view type

* gguf-py : remove LlamaFileTypeMap

Too specific to 'llama.cpp', and would be a maintenance burden
to keep up to date.

* gguf-py : add generic quantize and dequantize functions

The quant classes no longer need to be known,
only the target or the source type,
for 'quantize' and 'dequantize', respectively.

convert_hf_to_gguf.py		diff \| blob \| history
gguf-py/gguf/constants.py		diff \| blob \| history
gguf-py/gguf/lazy.py		diff \| blob \| history
gguf-py/gguf/quants.py		diff \| blob \| history