git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

author	Olivier Chafik <redacted>
	Thu, 13 Feb 2025 10:05:16 +0000 (10:05 +0000)
committer	GitHub <redacted>
	Thu, 13 Feb 2025 10:05:16 +0000 (10:05 +0000)
commit	c7f460ab882065ec5696e3d51e24dbf67b539287
tree	2b3d8da6aa66358f9fa478e2b1a4e6c3ef5fdc20	tree
parent	27e8a23300e30cd6ff6107ce262acf832ca60597	commit \| diff

`server`: fix tool-call of DeepSeek R1 Qwen, return reasoning_content (Command 7RB & DeepSeek R1) unless `--reasoning-format none` (#11607)

* extract & return thoughts in reasoning_content field (unless --reasoning-format) for DeepSeek R1 & Command R7B

* tool-calls: add deepseek r1 template (models/templates/llama-cpp-deepseek-r1.jinja) + hackommodate broken official template

* tool-calls: accommodate variety of wrong tool call opening tags both R1 Qwen 32B and 7B distills like to spit out

* server/oai: ensure content is null when there are tool calls, and reasoning_content appears before content for readability

* tool-calls: add DeepSeek R1 Qwen distills to server/README.md & server tests

Co-authored-by: Georgi Gerganov <redacted>
---------

Co-authored-by: Georgi Gerganov <redacted>

common/arg.cpp		diff \| blob \| history
common/chat.cpp		diff \| blob \| history
common/chat.hpp		diff \| blob \| history
common/common.h		diff \| blob \| history
common/sampling.cpp		diff \| blob \| history
examples/server/README.md		diff \| blob \| history
examples/server/server.cpp		diff \| blob \| history
examples/server/tests/unit/test_tool_call.py		diff \| blob \| history
examples/server/tests/utils.py		diff \| blob \| history
examples/server/utils.hpp		diff \| blob \| history
models/templates/README.md	[new file with mode: 0644]	blob
models/templates/deepseek-ai-DeepSeek-R1-Distill-Llama-8B.jinja		diff \| blob \| history
models/templates/deepseek-ai-DeepSeek-R1-Distill-Qwen-32B.jinja		diff \| blob \| history
models/templates/llama-cpp-deepseek-r1.jinja	[new file with mode: 0644]	blob
scripts/get_chat_template.py	[changed mode: 0644->0755]	diff \| blob \| history
src/llama-grammar.cpp		diff \| blob \| history
tests/test-chat.cpp		diff \| blob \| history