Files
ik_llama.cpp/ggml
Iwan Kawrakow eeea8af04f iq5_ks
63.8 t/s -> 166 t/s. iq5_ks_r4 is at 107.4 t/s.
But: iw5_ks_r4 TG performance is quite a bit better:
21.7 t/s vs 17.7 t/s for iq5_ks.
2025-06-22 18:49:42 +02:00
..
2024-07-27 07:55:01 +02:00
2025-06-08 17:27:00 +03:00
2025-06-22 18:49:42 +02:00
2024-07-27 07:55:01 +02:00