Files
ik_llama.cpp/ggml
Iwan Kawrakow 23e9033f7b iq2_kl: MMVQ
We get PP-128(L3-8B) = 162 t/s.
Which means that this is not quite as good as it should be as
(almost) same bpq q2_K is at 170 t/s.
2025-07-13 20:15:19 +03:00
..
2024-07-27 07:55:01 +02:00
2025-07-13 20:15:19 +03:00
2025-07-13 20:15:19 +03:00
2024-07-27 07:55:01 +02:00