Files
ik_llama.cpp/ggml
Iwan Kawrakow 1be0a9e0d7 iq4_kt: go to 4.0 bpw
15 bits per group of 4, plus 8 bit scales ifor blocks of 32.
This gives a slightly better PPL than iq4_kss.
2024-11-21 08:16:41 +02:00
..
2024-07-27 07:55:01 +02:00
2024-11-21 08:16:41 +02:00
2024-07-27 07:55:01 +02:00
2024-10-04 14:43:26 +03:00