Files
ik_llama.cpp/ggml
Iwan Kawrakow 9802c771b8 iq3_k: fix Metal dot product
I was accessing the scales as 4-byte aligned, but iq3_k is
not 4-byte aligned. Instead of throwing an error (as it happens
on CUDA when one makes this mistake), Metal silently accepts
and we get garbage.
2024-10-14 10:15:25 +03:00
..
2024-07-27 07:55:01 +02:00
2024-10-14 10:15:25 +03:00
2024-07-27 07:55:01 +02:00
2024-10-04 14:43:26 +03:00