mirror of
https://github.com/ikawrakow/ik_llama.cpp.git
synced 2026-02-25 23:54:10 +00:00
For iq4_kt this results in a massive PP improvement from PP512 = ~42 t/s to PP512 = 128 t/s.
For iq4_kt this results in a massive PP improvement from PP512 = ~42 t/s to PP512 = 128 t/s.