ik_llama.cpp/common/chat.cpp at e722f0bb736dfee84a11408bb28f2f70ebc86259

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-05-24 06:35:28 +00:00

Files

Leo Zhang cb58a561f0 fix: Kimi-K2 parser ignores enable_thinking=false, response goes to reasoning_content (#1686 )

When `enable_thinking` is false (e.g. via `chat_template_kwargs: {thinking: false}`),
the Kimi-K2 parser was still extracting reasoning tokens and routing all output to
`reasoning_content` instead of `content`.

The fix aligns with how other thinking models (Qwen3, etc.) handle this: check both
`reasoning_format != NONE` and `enable_thinking` before enabling reasoning extraction.

Fixes #1685

2026-04-24 17:37:29 +02:00

107 KiB

Raw Blame History

View Raw

107 KiB Raw Blame History

107 KiB

Raw Blame History