Files
ik_llama.cpp/common
firecoperana bb358223cd server: cache prompt to host memory (#954)
* server : host-memory prompt caching

change similarity calculation and prompt save conditions

Remove unneeded token limit

rename variable

Separate prompt save and load logic

change default values

change log

remove truncate prompt logic

* add description

* bug fixes

* remove token limit in init

---------

Co-authored-by: firecoperana <firecoperana>
2025-11-14 18:40:13 +02:00
..
2024-07-27 07:55:01 +02:00
2025-09-01 08:38:49 +03:00
2024-07-27 07:55:01 +02:00
2023-11-13 14:16:23 +02:00