git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

author	JJJYmmm <redacted>
	Thu, 30 Oct 2025 15:19:14 +0000 (23:19 +0800)
committer	GitHub <redacted>
	Thu, 30 Oct 2025 15:19:14 +0000 (16:19 +0100)
commit	d261223d24e97f2df50220e4a5b7f0adb69bba81
tree	2f4e3204c844895f75af5d0b2b2039f71c8be427	tree
parent	dcca0d3ab840ebe9b2ccd4719033d408eeb758d7	commit \| diff

model: add support for qwen3vl series (#16780)

* support qwen3vl series.

Co-authored-by: Thireus ☠ <redacted>
Co-authored-by: yairpatch <redacted>
Co-authored-by: LETS-BEE <redacted>
* bugfix: fix the arch check for qwen3vl-moe.

* use build_ffn

* optimize deepstack structure

* optimize deepstack feature saving

* Revert "optimize deepstack feature saving" for temporal fix

This reverts commit f321b9fdf13e59527408152e73b1071e19a87e71.

* code clean

* use fused qkv in clip

* clean up / rm is_deepstack_layers for simplification

* add test model

* move test model to "big" section

* fix imrope check

* remove trailing whitespace

* fix rope fail

* metal : add imrope support

* add imrope support for sycl

* vulkan: add imrope w/o check

* fix vulkan

* webgpu: add imrope w/o check

* Update gguf-py/gguf/tensor_mapping.py

Co-authored-by: Sigbjørn Skjæret <redacted>
* fix tensor mapping

---------

Co-authored-by: Thireus ☠ <redacted>
Co-authored-by: yairpatch <redacted>
Co-authored-by: LETS-BEE <redacted>
Co-authored-by: Xuan Son Nguyen <redacted>
Co-authored-by: Georgi Gerganov <redacted>
Co-authored-by: Sigbjørn Skjæret <redacted>

convert_hf_to_gguf.py		diff \| blob \| history
ggml/include/ggml.h		diff \| blob \| history
ggml/src/ggml-cpu/ops.cpp		diff \| blob \| history
ggml/src/ggml-cuda/rope.cu		diff \| blob \| history
ggml/src/ggml-metal/ggml-metal-device.cpp		diff \| blob \| history
ggml/src/ggml-metal/ggml-metal-impl.h		diff \| blob \| history
ggml/src/ggml-metal/ggml-metal.metal		diff \| blob \| history
ggml/src/ggml-sycl/rope.cpp		diff \| blob \| history
ggml/src/ggml-vulkan/ggml-vulkan.cpp		diff \| blob \| history
ggml/src/ggml-vulkan/vulkan-shaders/rope_head.glsl		diff \| blob \| history
ggml/src/ggml-vulkan/vulkan-shaders/rope_multi.comp		diff \| blob \| history
ggml/src/ggml-webgpu/wgsl-shaders/rope.tmpl.wgsl		diff \| blob \| history
gguf-py/gguf/constants.py		diff \| blob \| history
gguf-py/gguf/gguf_writer.py		diff \| blob \| history
gguf-py/gguf/tensor_mapping.py		diff \| blob \| history
include/llama.h		diff \| blob \| history
src/llama-arch.cpp		diff \| blob \| history
src/llama-arch.h		diff \| blob \| history
src/llama-hparams.cpp		diff \| blob \| history
src/llama-hparams.h		diff \| blob \| history
src/llama-kv-cache.cpp		diff \| blob \| history
src/llama-model.cpp		diff \| blob \| history
tests/test-backend-ops.cpp		diff \| blob \| history
tests/test-rope.cpp		diff \| blob \| history
tools/mtmd/clip-impl.h		diff \| blob \| history
tools/mtmd/clip.cpp		diff \| blob \| history
tools/mtmd/mtmd.cpp		diff \| blob \| history
tools/mtmd/tests.sh		diff \| blob \| history