Files
ik_llama.cpp/ggml
Iwan Kawrakow eeaa65d5b7 iq2_k: Metal dot product finally works
It is slow: 45.4 t/s for 7B model vs 50 t/s for iq2_xs,
or 63.3 t/s for q2_K_S.
2024-07-31 08:01:45 +02:00
..
2024-07-27 07:55:01 +02:00
2024-07-30 16:11:25 +03:00
2024-07-27 07:55:01 +02:00
2024-07-27 07:55:01 +02:00