Files
ik_llama.cpp/ggml
Iwan Kawrakow ef2b0066b9 On Zen4 repack fp16 models to bf16_r16 when run-time-repacking is requested
This massively improves performance. As this is opt-in, we do not worry
about possible precision loss in the f16 -> bf16 conversion.
2025-01-21 19:14:57 +02:00
..
2024-10-04 14:43:26 +03:00