]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
gguf-py : simplify support for quant types (#8838)
authorcompilade <redacted>
Thu, 8 Aug 2024 17:33:09 +0000 (13:33 -0400)
committerGitHub <redacted>
Thu, 8 Aug 2024 17:33:09 +0000 (13:33 -0400)
commit3a14e00366399040a139c67dd5951177a8cb5695
tree600c6b5efabfbec3787fe4f83c8db30d0004dd85
parentafd27f01fe832ece3d07ef03b7d34a9e80c4a895
gguf-py : simplify support for quant types (#8838)

* gguf-py : use classes for quants

* convert_hf : simplify internal quantization type selection

* gguf-py : fix flake8 lint

* gguf-py : fix BF16 numpy view type

* gguf-py : remove LlamaFileTypeMap

Too specific to 'llama.cpp', and would be a maintenance burden
to keep up to date.

* gguf-py : add generic quantize and dequantize functions

The quant classes no longer need to be known,
only the target or the source type,
for 'quantize' and 'dequantize', respectively.
convert_hf_to_gguf.py
gguf-py/gguf/constants.py
gguf-py/gguf/lazy.py
gguf-py/gguf/quants.py