ik_llama.cpp/iqk-quantize.cpp at 7b3cb2b96c5e3acbde859472047b47568818f1b5

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-03-13 15:30:03 +00:00

Files

Kawrakow b0967ffa79 bitnet: fix scalar dot product

I had forgotten to adjust for the change to q8_K64.
On the M2 I'm getting 10.8 t/s with the scalar version!

2024-06-22 12:02:51 +03:00

View Raw