Files
ik_llama.cpp/common/chat.cpp
Leo Zhang cb58a561f0 fix: Kimi-K2 parser ignores enable_thinking=false, response goes to reasoning_content (#1686)
When `enable_thinking` is false (e.g. via `chat_template_kwargs: {thinking: false}`),
the Kimi-K2 parser was still extracting reasoning tokens and routing all output to
`reasoning_content` instead of `content`.

The fix aligns with how other thinking models (Qwen3, etc.) handle this: check both
`reasoning_format != NONE` and `enable_thinking` before enabling reasoning extraction.

Fixes #1685
2026-04-24 17:37:29 +02:00

107 KiB