Files
ik_llama.cpp/ggml
Iwan Kawrakow 23b7da78d1 Metal: speed up mul_mat_id
For the Granite-1B MoE model PP-512 goes from
156 t/s to 890 t/s, so nearly a 6X speedup!
2024-10-31 09:47:24 +01:00
..
2024-07-27 07:55:01 +02:00
2024-10-30 11:02:15 +02:00
2024-10-31 09:47:24 +01:00
2024-07-27 07:55:01 +02:00
2024-10-04 14:43:26 +03:00