mirror of
https://github.com/ikawrakow/ik_llama.cpp.git
synced 2026-02-24 07:04:11 +00:00
At least according to rmse, this is significantly better than q2_K, while using only 1/16 more bits per weight.
At least according to rmse, this is significantly better than q2_K, while using only 1/16 more bits per weight.