Files
ik_llama.cpp/iqk-quantize.cpp
Kawrakow b0967ffa79 bitnet: fix scalar dot product
I had forgotten to adjust for the change to q8_K64.
On the M2 I'm getting 10.8 t/s with the scalar version!
2024-06-22 12:02:51 +03:00

17 KiB