Files
ik_llama.cpp/ggml
Kawrakow ada5cc1523 Fused norm (#1086)
* Adding fused_norm - same idea as fused_rms_norm

* Avoid computing the attention reduce op for cohere2

---------

Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
2025-12-24 15:22:43 +01:00
..
2024-07-27 07:55:01 +02:00
2025-12-24 15:22:43 +01:00
2025-12-24 15:22:43 +01:00
2024-07-27 07:55:01 +02:00