Files
ik_llama.cpp/ggml
Iwan Kawrakow f2be982fd8 New iq2_kt: Metal - very slow.
It seems Apple Silicon cannot quickly add 4 8-bit ints.
Or I don't know how to do it - but I didn't find anything
in the Metal Shading Language Specification.
So, performance is quite a bit worse than the original trellis.
2025-06-18 15:35:54 +03:00
..
2024-07-27 07:55:01 +02:00
2025-06-08 17:27:00 +03:00
2025-06-18 15:35:54 +03:00
2024-07-27 07:55:01 +02:00