git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

author	ddh0 <redacted>
	Fri, 20 Feb 2026 08:20:16 +0000 (02:20 -0600)
committer	GitHub <redacted>
	Fri, 20 Feb 2026 08:20:16 +0000 (09:20 +0100)
commit	492bc319782b1f13f302911f4c73437382cc8bb9
tree	1f675c604a3cae9f46b299a5e854309ee07cac30	tree
parent	77d6ae4ac89bb879ada3989a748276dfe4553674	commit \| diff

quantize : add --dry-run option (#19526)

* clean slate for branch

* use 6 characters for tensor dims

* add --dry-run to llama-quantize

* use 6 characters for tensor dims (cont.)

* no need to re-calculate ggml_nbytes for tensor

* fix indent

* show model and quant BPW when quant completes

* add example to --help

* new function `tensor_requires_imatrix`, add courtesy warning about imatrix

* missing __func__, move imatrix flag set

* logic error

* fixup tensor_requires_imatrix

* add missing `GGML_TYPE`s

* simplify and rename `tensor_type_requires_imatrix`

* simplify for style

* add back Q2_K edge case for imatrix

* guard ftype imatrix warning

* comment ref #12557

* remove per @compilade

* remove unused `params` parameter

* move `bool dry_run` per GG

* move `bool dry_run` per GG

* Update src/llama-quant.cpp

Co-authored-by: Sigbjørn Skjæret <redacted>
* Update src/llama-quant.cpp

Co-authored-by: Sigbjørn Skjæret <redacted>
* Update src/llama-quant.cpp

Co-authored-by: Sigbjørn Skjæret <redacted>
---------

Co-authored-by: Sigbjørn Skjæret <redacted>

include/llama.h		diff \| blob \| history
src/llama-impl.cpp		diff \| blob \| history
src/llama-quant.cpp		diff \| blob \| history
tools/quantize/quantize.cpp		diff \| blob \| history