Files
ik_llama.cpp/ggml
Kawrakow 9c1eea6048 iq3_k: AVX2 iqk_mul_mat
We get PP-512 = 196 t/s for LLaMA-3.1-8B on the Ryzen-5975WX.
2024-08-01 09:38:06 +02:00
..
2024-07-27 07:55:01 +02:00
2024-08-01 09:38:06 +02:00
2024-08-01 09:38:06 +02:00
2024-07-27 07:55:01 +02:00
2024-07-27 07:55:01 +02:00