Files
ik_llama.cpp/iqk_mul_mat.cpp
Kawrakow 766975ecfa Bitnet(2.25 bpw): NEON
We get PP-512 = 192 t/s, TG-128 = 72 t/s
2024-06-22 12:02:52 +03:00

184 KiB