ik_llama.cpp

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-03-02 10:00:07 +00:00

Author	SHA1	Message	Date
yurko	6db8dc86ca	qwen3next: split cpu/cuda eval builds and tune PP scheduling	2026-02-06 19:28:17 -08:00
Yurko	e64b43392f	cuda: reduce qwen3next moe/ssm sync overhead and refresh eval	2026-02-06 14:46:59 +00:00
yurko	c767cfa1d3	docs: update qwen3next perf report for cuda MoE/SSM tuning	2026-02-06 13:52:54 +00:00
yurko	9fbb50481e	qwen3next: optimize broadcast sub and single-seq ssm conv	2026-02-06 12:50:43 +00:00