Files
ik_llama.cpp/ggml
Kawrakow 6f1a69352f Fuse experts bias in top_k_moe kernel (#1170)
* GLM-4.7-Flash support

* Model type

* Make FA work for mla != 0

* Fuse bias in top_k_moe kernel if present
2026-01-20 15:38:51 +02:00
..
2024-07-27 07:55:01 +02:00
2024-07-27 07:55:01 +02:00