mirror of
https://github.com/ikawrakow/ik_llama.cpp.git
synced 2026-05-24 06:35:28 +00:00
When `enable_thinking` is false (e.g. via `chat_template_kwargs: {thinking: false}`),
the Kimi-K2 parser was still extracting reasoning tokens and routing all output to
`reasoning_content` instead of `content`.
The fix aligns with how other thinking models (Qwen3, etc.) handle this: check both
`reasoning_format != NONE` and `enable_thinking` before enabling reasoning extraction.
Fixes #1685
107 KiB
107 KiB