Files
ik_llama.cpp/ggml
Iwan Kawrakow 73c551aa9e Fused ffn_up*unary_op(ffn_gate) for MMVQ (no bias)
We see nearly 2% TG speedup for Ling-mini-2.0 and
about 1% for DeepSeek-Lite.
2025-10-25 10:53:11 +03:00
..
2024-07-27 07:55:01 +02:00
2025-10-24 07:40:35 +03:00
2024-07-27 07:55:01 +02:00