ik_llama.cpp

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-02-24 07:04:11 +00:00

Files

Iwan Kawrakow b7c986e4ff Experiments for 2.6875 bpw quants

At least according to rmse, this is significantly better than
q2_K, while using only 1/16 more bits per weight.

2025-07-13 20:15:19 +03:00

CMakeLists.txt

2025-05-23 09:17:52 +03:00

quantize-stats.cpp

2025-07-13 20:15:19 +03:00