mirror of
https://github.com/ikawrakow/ik_llama.cpp.git
synced 2026-05-03 12:51:53 +00:00
15 lines
456 B
Markdown
15 lines
456 B
Markdown
### 🔀 [#174](https://github.com/ikawrakow/ik_llama.cpp/pull/174) - On Zen4 repack fp16 models to bf16_r16
|
|
|
|
| **Author** | `ikawrakow` |
|
|
| :--- | :--- |
|
|
| **State** | ❌ **Closed** |
|
|
| **Created** | 2025-01-21 |
|
|
| **Updated** | 2025-01-21 |
|
|
|
|
---
|
|
|
|
#### Description
|
|
|
|
...when run-time-repacking is requested via `-rtr`
|
|
|
|
This massively improves performance. As this is opt-in, we do not worry about possible precision loss in the `f16 -> bf16` conversion. |