]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
model: add support for qwen3vl series (llama/16780)
authorJJJYmmm <redacted>
Thu, 30 Oct 2025 15:19:14 +0000 (23:19 +0800)
committerGeorgi Gerganov <redacted>
Sat, 1 Nov 2025 07:41:35 +0000 (09:41 +0200)
commitfd33b4bb05d99c0784892a8f4e38d4c7a4d873e3
treec341a68a22f6c50db90f73d24f5dabd664f77859
parentd508a2bbd525edd31f5073688b93433fec900dd4
model: add support for qwen3vl series (llama/16780)

* support qwen3vl series.

Co-authored-by: Thireus ☠ <redacted>
Co-authored-by: yairpatch <redacted>
Co-authored-by: LETS-BEE <redacted>
* bugfix: fix the arch check for qwen3vl-moe.

* use build_ffn

* optimize deepstack structure

* optimize deepstack feature saving

* Revert "optimize deepstack feature saving" for temporal fix

This reverts commit f321b9fdf13e59527408152e73b1071e19a87e71.

* code clean

* use fused qkv in clip

* clean up / rm is_deepstack_layers for simplification

* add test model

* move test model to "big" section

* fix imrope check

* remove trailing whitespace

* fix rope fail

* metal : add imrope support

* add imrope support for sycl

* vulkan: add imrope w/o check

* fix vulkan

* webgpu: add imrope w/o check

* Update gguf-py/gguf/tensor_mapping.py

Co-authored-by: Sigbjørn Skjæret <redacted>
* fix tensor mapping

---------

Co-authored-by: Thireus ☠ <redacted>
Co-authored-by: yairpatch <redacted>
Co-authored-by: LETS-BEE <redacted>
Co-authored-by: Xuan Son Nguyen <redacted>
Co-authored-by: Georgi Gerganov <redacted>
Co-authored-by: Sigbjørn Skjæret <redacted>
12 files changed:
include/ggml.h
src/ggml-cpu/ops.cpp
src/ggml-cuda/rope.cu
src/ggml-metal/ggml-metal-device.cpp
src/ggml-metal/ggml-metal-impl.h
src/ggml-metal/ggml-metal.metal
src/ggml-sycl/rope.cpp
src/ggml-vulkan/ggml-vulkan.cpp
src/ggml-vulkan/vulkan-shaders/rope_head.glsl
src/ggml-vulkan/vulkan-shaders/rope_multi.comp
src/ggml-webgpu/wgsl-shaders/rope.tmpl.wgsl
tests/test-backend-ops.cpp