Commit Graph

6 Commits

Author SHA1 Message Date
Ke Bao
3aa11ca722 Remove hybrid_kvcache_ratio in server args (#16399) 2026-01-06 13:13:13 +08:00
Lianmin Zheng
bc3d2a85af [Minor] update docs (#14212) 2025-12-01 02:33:58 -08:00
Jan Bernlöhr
fcccaf9001 Add Llama4 attention backend auto-selection (#13421)
Signed-off-by: jbernloehr <jbernloehr@nvidia.com>
2025-11-25 11:54:21 -08:00
Mattheliu
c3bb348dad [Docs] fix dead links in multiple documentation pages (#12764) 2025-11-06 10:49:32 -08:00
Philip Kiely - Baseten
7f028b07c4 Fix formatting in long code blocks (#10528) 2025-09-16 12:02:05 -07:00
Lianmin Zheng
2449a0afe2 Refactor the docs (#9031) 2025-08-10 19:49:45 -07:00