Files
ik_llama.cpp/ggml
Iwan Kawrakow c85d7bef55 iq2_bn: improve performance on NEON
We now get TG-128 = 100 t/s for Bitnet-3B-1.58b!
2024-09-09 06:46:25 +02:00
..
2024-07-27 07:55:01 +02:00
2024-09-09 06:46:25 +02:00
2024-07-27 07:55:01 +02:00