ik_llama.cpp

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-02-24 15:14:10 +00:00

Files

Iwan Kawrakow eeeca319dd iq2_kt: Metal GEMV

Performance is actually quite decent: 52 t/s on my M2-Max for LlaMA-3.1-8B

2025-05-30 11:39:59 +03:00

2024-07-27 07:55:01 +02:00

2025-05-23 09:17:52 +03:00

iq2_kt: Metal GEMV

2025-05-30 11:39:59 +03:00

.gitignore

2024-07-27 07:55:01 +02:00

CMakeLists.txt

2025-05-17 11:21:58 +03:00