Files
ik_llama.cpp/ggml
Iwan Kawrakow 2986d3c21f Fused ffn_up*unary_op(ffn_gate) for MMVQ (no bias)
We see nearly 2% TG speedup for Ling-mini-2.0 and
about 1% for DeepSeek-Lite.
2025-10-24 11:27:18 +03:00
..
2024-07-27 07:55:01 +02:00
2025-10-24 07:40:35 +03:00
2024-07-27 07:55:01 +02:00