Files
ik_llama.cpp/llama.cpp
Kawrakow 318899c8b7 bitnet: add 2 bpw quantization
The scalar dot product already chieves 37 t/s for TG!
2024-06-22 12:02:51 +03:00

778 KiB