Files
ik_llama.cpp/common
firecoperana bacb8fb79f Server: Handle context shift better to reduce prompt processing time (#973)
* Handle context shift better to reduce pp

Add context-shift args

Add back ga_n in context shift

* optimize discard function and bring back n_keep = -1

---------

Co-authored-by: firecoperana <firecoperana>
2025-11-19 16:04:48 +01:00
..
2024-07-27 07:55:01 +02:00
2025-09-01 08:38:49 +03:00
2024-07-27 07:55:01 +02:00
2023-11-13 14:16:23 +02:00