Files
ik_llama.cpp/ggml
Iwan Kawrakow 26d677483e q4_k_r4: finally works on Zen4
I had forgotten to prevent token_embd.weight being quantized
with q4_k_r4!
2024-12-09 11:45:35 +02:00
..
2024-07-27 07:55:01 +02:00
2024-12-08 19:48:15 +02:00
2024-12-09 11:45:35 +02:00
2024-07-27 07:55:01 +02:00
2024-10-04 14:43:26 +03:00