ik_llama.cpp/examples/server/server-context.cpp at 557b674f63a617efa7aa0b075d9cf40134b0eeac

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-05-12 17:05:57 +00:00

Files

Samuel Oliveira Alves 557b674f63 Add llama_context to MTP (#1601 )

* wip: separate llama_context for MTP with graph reuse

* wip: fix KV cache desync with separate MTP context

* refactor: remove dead mtp logic code, encapsulate KV mirroring

* mtp-context: derive args directly from the main model's context

* mtp: fix kv cache positions

* clean small comments

* minor refactor for context shift

2026-04-09 15:33:56 +02:00

163 KiB

Raw Blame History

View Raw

163 KiB Raw Blame History

163 KiB

Raw Blame History