Files
ik_llama.cpp/ggml
Iwan Kawrakow 2afe2e1d41 Slightly faster FA for bf16 KV cache
~2-3% sort of thing. Sadly, when we go beyond 8k tokens, the
advantage kind of goes away.
2025-01-13 17:47:47 +02:00
..
2024-07-27 07:55:01 +02:00
2025-01-13 17:47:47 +02:00
2024-07-27 07:55:01 +02:00
2024-10-04 14:43:26 +03:00