Files
ik_llama.cpp/ggml
Kawrakow 4819257ce6 Quantization improvements (#295)
* Better make_qx_quants

Tested with q4_0 and q3_K (pure, imatrix), and the improvement is
quite significant.

* Sae for iq4_nl, iq4_xs

---------

Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
2025-03-29 08:09:52 +01:00
..
2024-07-27 07:55:01 +02:00
2025-03-29 08:09:52 +01:00
2024-07-27 07:55:01 +02:00