mirror of
https://github.com/ikawrakow/ik_llama.cpp.git
synced 2026-04-30 11:21:56 +00:00
456 B
456 B
🔀 #174 - On Zen4 repack fp16 models to bf16_r16
| Author | ikawrakow |
|---|---|
| State | ❌ Closed |
| Created | 2025-01-21 |
| Updated | 2025-01-21 |
Description
...when run-time-repacking is requested via -rtr
This massively improves performance. As this is opt-in, we do not worry about possible precision loss in the f16 -> bf16 conversion.