mirror of
https://github.com/ikawrakow/ik_llama.cpp.git
synced 2026-02-28 17:14:17 +00:00
PP-512 becomes 834 t/s and TG-128 now saturates to the same performance as iq2_bn for 4 threads.
PP-512 becomes 834 t/s and TG-128 now saturates to the same performance as iq2_bn for 4 threads.