git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

author	Pierrick Hymbert <redacted>
	Fri, 26 Apr 2024 18:06:33 +0000 (20:06 +0200)
committer	GitHub <redacted>
	Fri, 26 Apr 2024 18:06:33 +0000 (20:06 +0200)
commit	0c4d489e29e53589bf13a801fe7c94b7b546d8f6
tree	fc83fade919050b3a9471dd892d8aef438c39aaf	tree
parent	017e6999b5184234370b22a2f868e1be911e8d88	commit \| diff

quantize: add imatrix and dataset metadata in GGUF (#6658)

* imatrix: save the dataset file used in the output file

* llama: support kv overrides type string string

* common: factorize KV Overrides parsing between common and server

* quantize: add imatrix n entries and dataset KV metadata
quantize: factorize KV Overrides parsing between common
#6656

* llama: remove kv override str_value initialization as it does not compile on some toolchain

* quantize: add imatrix m_last_call as `quantize.imatrix.chunks_count`

* quantize: add imatrix filename in KV

* llama: add llama_model_kv_override_free

* common: add llama_model_kv_override_free
common: free kv override if used after model loading

* llama: finally move the string KV override value to the stack

* llama : minor

* no need to add a NUL to the std::vector, std::string can be initialized from a pair of iterators.

Co-authored-by: slaren <redacted>
* kv override: ensure string termination

---------

Co-authored-by: Georgi Gerganov <redacted>
Co-authored-by: slaren <redacted>

Makefile		diff \| blob \| history
common/common.cpp		diff \| blob \| history
common/common.h		diff \| blob \| history
examples/imatrix/imatrix.cpp		diff \| blob \| history
examples/quantize/CMakeLists.txt		diff \| blob \| history
examples/quantize/quantize.cpp		diff \| blob \| history
examples/server/server.cpp		diff \| blob \| history
llama.cpp		diff \| blob \| history
llama.h		diff \| blob \| history