mirror of
https://github.com/ikawrakow/ik_llama.cpp.git
synced 2026-02-25 15:44:10 +00:00
We get 225.7 t/s for L3-8B. In comparison q8_0 without run-tinme-repacking is at 169 t/s.
We get 225.7 t/s for L3-8B. In comparison q8_0 without run-tinme-repacking is at 169 t/s.