Files
ik_llama.cpp/include
Iwan Kawrakow c828bd3c57 iq3_k: Basics
Quantize/dequantize, CUDA dequantize.
PPL of LLaMA-3.1-8B is better than iq3_s and iq3_m.
2024-07-30 16:11:25 +03:00
..
2024-07-30 16:11:25 +03:00