Files
ik_llama.cpp/ggml
Iwan Kawrakow eeeca319dd iq2_kt: Metal GEMV
Performance is actually quite decent: 52 t/s on my M2-Max for LlaMA-3.1-8B
2025-05-30 11:39:59 +03:00
..
2024-07-27 07:55:01 +02:00
2025-05-30 11:39:59 +03:00
2024-07-27 07:55:01 +02:00