]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
`server`: add `--reasoning-budget 0` to disable thinking (incl. qwen3 w/ enable_think...
authorOlivier Chafik <redacted>
Sun, 25 May 2025 23:30:51 +0000 (00:30 +0100)
committerGitHub <redacted>
Sun, 25 May 2025 23:30:51 +0000 (00:30 +0100)
commite121edc4324a640be11b7e567edd39b721b0f8e4
tree49d2cc0f39354908b5f1f9747d8252a830f6a599
parent2f099b510f460374acd52742b494595e3e3442d3
`server`: add `--reasoning-budget 0` to disable thinking (incl. qwen3 w/ enable_thinking:false) (#13771)

---------

Co-authored-by: ochafik <redacted>
Co-authored-by: Xuan-Son Nguyen <redacted>
12 files changed:
common/arg.cpp
common/chat.cpp
common/chat.h
common/common.h
models/templates/Qwen-Qwen3-0.6B.jinja [new file with mode: 0644]
models/templates/README.md
tests/test-chat.cpp
tools/server/README.md
tools/server/server.cpp
tools/server/tests/unit/test_template.py
tools/server/tests/utils.py
tools/server/utils.hpp