Files
ik_llama.cpp/ggml
Kawrakow fa668c7dcb iq6_k: Metal
About 4% slower than Q6_K for PP-512, but 10% faster for TG-128.
Someone has screwed up Q6_K TG performance on Metal? With the
cobntinuous "improvements" in ggml I wouldn't be surprised.
Need to look into it later.
2024-08-09 16:00:31 +02:00
..
2024-07-27 07:55:01 +02:00
2024-08-09 16:00:31 +02:00
2024-08-09 16:00:31 +02:00
2024-07-27 07:55:01 +02:00
2024-07-27 07:55:01 +02:00