ik_llama.cpp

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-03-09 13:30:17 +00:00

Files

Iwan Kawrakow a0ba58e9b9 iq2_kt and iq3_kt work with new int trellis

Much slower than the fp16 based trellis. I guess, Apple doesn't
have int8_t SIMD on the M2-Max GPU.

2025-06-20 10:47:22 +03:00

2024-07-27 07:55:01 +02:00

2025-06-08 17:27:00 +03:00

2025-06-20 10:47:22 +03:00

.gitignore

2024-07-27 07:55:01 +02:00

CMakeLists.txt

2025-06-12 19:25:11 +03:00