mirror of
https://github.com/ikawrakow/ik_llama.cpp.git
synced 2026-02-08 07:20:12 +00:00
* GLM-4.7-Flash support * Model type * Make FA work for mla != 0 * Fuse bias in top_k_moe kernel if present
* GLM-4.7-Flash support * Model type * Make FA work for mla != 0 * Fuse bias in top_k_moe kernel if present