Files
ik_llama.cpp/include
Iwan Kawrakow 93c6e295ee q2_k_r4: Zen4
PP-512(LLaMA-3.1-8B) = 256 t/s
2024-12-11 16:28:56 +02:00
..
2024-12-11 16:28:56 +02:00