Files
ik_llama.cpp/llama.cpp
Georgi Gerganov 110487aa7b llama : pad KV cache size (#4280)
* llama : pad KV cache size to 32

* metal : try to improve batched decoding
2023-12-03 10:58:16 +02:00

372 KiB