Files
ik_llama.cpp/ggml
Kawrakow 3805c84686 Improve Bitnet PP on Metal (#108)
iq1_bn goes from 702 t/s to 716 t/s
iq2_bn goes from 714 t/s to 743 t/s

Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
2024-10-26 15:13:45 +02:00
..
2024-07-27 07:55:01 +02:00
2024-10-26 15:13:45 +02:00
2024-07-27 07:55:01 +02:00
2024-10-04 14:43:26 +03:00