]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
mtmd: refactor preprocessing + support max/min pixels (#16878)
authorXuan-Son Nguyen <redacted>
Sat, 1 Nov 2025 14:51:36 +0000 (15:51 +0100)
committerGitHub <redacted>
Sat, 1 Nov 2025 14:51:36 +0000 (15:51 +0100)
commitcf659bbb8ef9eb048e5153a27cf787fd83c05560
tree215f09ecdcc68c670ae672f608b306a79e1b98f3
parentd8b860a219c2415faac8cc0e50b48b4aa11e3b64
mtmd: refactor preprocessing + support max/min pixels (#16878)

* mtmd: refactor preprocessing + support max/min pixels

* fix mlp type

* implement mix/max pixels

* improve hparams

* better image preproc for qwen

* fix

* fix out of bound composite

* fix (2)

* fix token calculation

* get_merge_kernel_size()

* fix llama4 and lfm2

* gonna fix them all

* use simple resize for qwen

* qwen: increase min tokens

* no resize if dst size == src size

* restore to initial min/max tokens value for qwen
tools/mtmd/clip-impl.h
tools/mtmd/clip.cpp