Files
ik_llama.cpp/ggml
Iwan Kawrakow ee8f966202 q4_0_r8 (Zen4) - slightly better
282 t/s for a pure q4_0 L3-8B quantization.
2025-01-27 09:19:13 +02:00
..
2024-07-27 07:55:01 +02:00
2025-01-27 09:19:13 +02:00
2024-07-27 07:55:01 +02:00
2024-10-04 14:43:26 +03:00