mirror of
https://github.com/ikawrakow/ik_llama.cpp.git
synced 2026-03-15 00:07:36 +00:00
This assertion can hit during prefill as MLA/KV tensors grow, e.g. Kimi K2 n_ctx >= 32768.
This assertion can hit during prefill as MLA/KV tensors grow, e.g. Kimi K2 n_ctx >= 32768.