ik_llama.cpp/iqk-quantize.cpp at e6d8441397971ef924a8f7cee3fe8f204f6917f2

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-03-10 05:50:08 +00:00

Files

Kawrakow b0967ffa79 bitnet: fix scalar dot product

I had forgotten to adjust for the change to q8_K64.
On the M2 I'm getting 10.8 t/s with the scalar version!

2024-06-22 12:02:51 +03:00

View Raw