Files
ik_llama.cpp/iqk-quantize.cpp
Iwan Kawrakow f6863cfa1b bitnet: add 2 bpw quantization
The scalar dot product already chieves 37 t/s for TG!
2024-06-22 12:02:51 +03:00

15 KiB