Files
ik_llama.cpp/ggml
Iwan Kawrakow 8178075f84 iq2_tn: small NEON improvement
For TriLM-3.9B we now get PP-512 = 206.6 t/s and TG-128 = 76.4 t/s.
2024-08-06 12:08:22 +02:00
..
2024-07-27 07:55:01 +02:00
2024-08-06 12:08:22 +02:00
2024-07-27 07:55:01 +02:00
2024-07-27 07:55:01 +02:00