Files
ik_llama.cpp/include
Iwan Kawrakow b159b2b113 iq4_kss: CUDA dequantize works
So we can run perplexity. Sadly, the result does not look good
on the bpw vs quantization error plot.
2024-10-16 14:14:00 +03:00
..
2024-10-16 14:14:00 +03:00