Files
ik_llama.cpp/ggml
Iwan Kawrakow fe24edab76 Fixed various issues
As we don't have a way to tell if a repacked quant has been modified,
I had to remove the modification at the expense of a slight decrease
in performance. This affects q8_0_r8, q8_KV_r8, q8_k_r8 on Zen4, and
q4_0_r8 on ARM.
2025-03-20 11:52:59 +02:00
..
2024-07-27 07:55:01 +02:00
2025-03-20 11:52:59 +02:00
2024-07-27 07:55:01 +02:00