Files
ik_llama.cpp/ggml
Iwan Kawrakow fb6a0d0184 iq1_s_r4: MMQ on CUDA
Requires Turing or better (will fall back to dequantize+cuBLAS on older cards).
2025-06-04 15:11:17 +03:00
..
2024-07-27 07:55:01 +02:00
2025-06-04 15:11:17 +03:00
2024-07-27 07:55:01 +02:00