Files
ik_llama.cpp/ggml
Iwan Kawrakow e9bb1a54ee iq2_bn: improve AVX2 implementation
We now get PP-512 = 753 t/s up from 680 t/s.
2024-09-09 12:38:36 +03:00
..
2024-07-27 07:55:01 +02:00
2024-09-09 12:38:36 +03:00
2024-07-27 07:55:01 +02:00