Commit Graph

16 Commits

Author SHA1 Message Date
Cao E
274581fb77 Add support for more batch sizes in cpu_graph_runner (#13881) 2026-03-19 09:50:56 -07:00
blzheng
cbea9f6909 [CPU] improve numa memory binding (#19666)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
2026-03-18 22:15:50 -07:00
Zaili Wang
97593c9f41 [CPU] toml file update (#17861) 2026-01-31 13:16:06 -08:00
Zaili Wang
672eb37534 [CPU][Fix CI] Solidate torch version for sgl-kernel-cpu and fix device orientation error (#17460) 2026-01-22 14:04:50 +08:00
Zaili Wang
4f73e53dcc [CPU] document updates (#14272) 2025-12-03 19:56:06 -08:00
Lianmin Zheng
bc3d2a85af [Minor] update docs (#14212) 2025-12-01 02:33:58 -08:00
Zaili Wang
cf5d27e30a [CPU] Upgrade default PT version to 2.9 (#12611) 2025-11-05 18:16:35 -08:00
Vincent Zhong
b4d2da106e [docs] upd docker files names everywhere (#12133) 2025-10-25 18:20:23 -07:00
Zaili Wang
007b849b0e [CPU] misc updates (#11906) 2025-10-22 21:10:05 -07:00
Zaili Wang
f19613e6c3 Dedicated toml files for CPU/XPU (#10734) 2025-10-10 00:44:55 -07:00
Zaili Wang
6fd4816d9f Fix sgl_kernel import failure on devices other than CUDA (#10610) 2025-09-18 11:38:02 -07:00
Zaili Wang
925dbb3218 [CPU] fix CPU backend sel. issue for Llama4 (#10511) 2025-09-16 02:57:45 -07:00
Zaili Wang
7bc5fb0d78 [CPU][doc] add torch.compile param in example commands (#10349) 2025-09-11 19:22:46 -07:00
Zaili Wang
ef959d7b85 [CPU] fix OOM when mem-fraction is not set (#9090) 2025-09-10 23:52:22 -07:00
Cao E
7577f0e40f Add graph runner support with torch compile on CPU (#7843) 2025-09-07 21:33:58 -07:00
Lianmin Zheng
2449a0afe2 Refactor the docs (#9031) 2025-08-10 19:49:45 -07:00