mirror of
https://github.com/ikawrakow/ik_llama.cpp.git
synced 2026-01-26 17:20:01 +00:00
* Use bperm trick for iq3_ks - 5% PP performance gain * Use bperm trick for iq3_k -> 5% PP performance gain * Use bperm trick for iq3_k -> 8% PP performance gain * Use bperm trick for iq3_k_r4 gemv -> ~5% faster * Use bperm trick for iq3_k gemv -> ~3% faster * Use bperm trick for iq3_k gemv -> 4.5% gain --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>