Files
ik_llama.cpp/ggml
Iwan Kawrakow 7cbe979ee0 iq4_kss: somewhat faster Metal dot product
45.75 t/s -> 48.75 t/s.
Still 22% slower than q4_0
2024-10-16 14:14:00 +03:00
..
2024-07-27 07:55:01 +02:00
2024-10-16 14:14:00 +03:00
2024-07-27 07:55:01 +02:00
2024-10-04 14:43:26 +03:00