Files
ik_llama.cpp/ggml
Iwan Kawrakow 57df5ccdd7 iq2_k: Metal dot product finally works
It is slow: 45.4 t/s for 7B model vs 50 t/s for iq2_xs,
or 63.3 t/s for q2_K_S.
2024-08-01 09:38:06 +02:00
..
2024-07-27 07:55:01 +02:00
2024-08-01 09:38:06 +02:00
2024-07-27 07:55:01 +02:00
2024-07-27 07:55:01 +02:00