Commit Graph

1106 Commits

Author SHA1 Message Date
Kangyan-Zhou
b2d8cc8cf0 Fix dev Docker build OOM on ARM64 cu13 by adding docker system prune (#18947)
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-22 10:29:31 +08:00
Baizhou Zhang
c36a10aabb Tiny update pull-requests permission of release-branch-cut.yml (#19121) 2026-02-21 20:14:31 +08:00
HAI
b2573fe426 Upd: CODEOWNERS (#19055)
Co-authored-by: Liangsheng Yin <hnyls2002@gmail.com>
2026-02-21 07:51:53 +08:00
Douglas Yang
77fdb6af81 feature: docker patch workflow (#19025) 2026-02-20 15:37:40 +08:00
Liangsheng Yin
83a475e8d7 feat: add cuda core dump CI warpper (#18909) 2026-02-17 14:49:26 -08:00
ronnie_zheng
10569d04bb [diffusion] update code owner (#18495) 2026-02-17 20:36:06 +03:00
Makcum888e
2aa0db7d9c [Diffusion] [NPU] Fix CI run (#18921) 2026-02-17 16:54:19 +03:00
Douglas Yang
f1efb46bdd fix: adding performance logging for nightly diffusion (#18023) 2026-02-16 14:09:00 +08:00
Douglas Yang
2050875424 fix: unifying docker image build pipeline (#18814) 2026-02-16 14:05:55 +08:00
Douglas Yang
45715af50c fix: nightly whl dev date suffix (#18873) 2026-02-16 10:57:37 +08:00
SoluMilken
07a24f1a38 update pre-commit config (#18860) 2026-02-16 00:18:31 +08:00
Alison Shao
f7603203b0 Enable DeepGemm fast warmup in CI to prevent cold-cache timeouts (#18823) 2026-02-16 00:02:30 +08:00
Douglas Yang
4ef8ece08a feature: adding build commit to sgl kernel workflow (#18853) 2026-02-15 23:28:43 +08:00
chenxu214
4e162d4b1b change npu.dockerfile (#18835) 2026-02-15 20:43:15 +08:00
Michael
88010e9601 [AMD] Fix nightly 1-GPU test failures and bench_serving regression (#18761)
Co-authored-by: michaelzhang-ai <michaelzhang-ai@users.noreply.github.com>
2026-02-15 20:36:47 +08:00
Mohammad Miadh Angkad
b1b69ae0a9 Add CI permissions (#18847) 2026-02-15 08:24:36 +08:00
Bingxu Chen
38473f8ee0 [AMD] Fix sgl-model-gateway Build Errors in ROCm Docker Release (#18836) 2026-02-14 00:07:26 -08:00
Zehuan Li
e2eb5bf28d [DLLM] Update CODEOWNERS for diffusion LLM (#18834) 2026-02-14 15:31:42 +08:00
Alison Shao
8ef3e3d56b Fix CI concurrency collision between scheduled runs and fork PRs (#18826) 2026-02-14 10:48:31 +08:00
Kangyan-Zhou
eccf875d49 [CI] Revive 8-GPU trace upload in nightly test workflow (#18820)
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-14 08:37:08 +08:00
Mohammad Miadh Angkad
1be41e9036 [FlashInfer] Bump FlashInfer version from 0.6.2 to 0.6.3 (#18448) 2026-02-14 07:43:33 +08:00
shuwenn
bc2405e6c1 feat: support release lookup (#18450) 2026-02-13 10:47:02 +08:00
Douglas Yang
e730c728d3 fix: image version in pypi pr workflow (#18735) 2026-02-13 00:18:01 +08:00
Xiaoyu Zhang
9e9e949261 speed up sgl-kernel build (#18586) 2026-02-12 23:43:22 +08:00
Simo Lin
92c5749f41 refactor: replace local proto compilation with smg-grpc-proto package (#18682) 2026-02-12 05:29:24 -08:00
Kangyan-Zhou
f116b3a51b Make PR based docker and pypi workflow work for forked PR (#18720) 2026-02-12 21:05:17 +08:00
YC Tseng
d6f0ef677b [AMD] reset AMD image release time and reduce CI queue time (#18707) 2026-02-12 01:05:53 -08:00
Alan Kao
0305d12df2 [AMD] Enable release image build for ROCm 7.2.0 (#18698) 2026-02-11 23:16:54 -08:00
YC Tseng
20554a0a4f [AMD] rocm 7.2 image release, PR test, Nightly Test (#17799)
Co-authored-by: Alan Kao <akao@amd.com>
Co-authored-by: bingxche <Bingxu.Chen@amd.com>
Co-authored-by: Michael <13900043+michaelzhang-ai@users.noreply.github.com>
2026-02-11 21:29:25 -08:00
Ke Bao
93ede0db19 Update ci permission (#18693) 2026-02-12 13:25:43 +08:00
Liangsheng Yin
3404dda592 Try fix the max-parallel for maunally triggered test again. (#18686) 2026-02-11 20:08:25 -08:00
Liangsheng Yin
b2d7fd5c87 fix the max-parallel for /rerun-stage (#18658) 2026-02-11 19:06:51 -08:00
Liangsheng Yin
10c6bee74f List more CI runs for pr-test (#18650) 2026-02-11 18:36:45 -08:00
Jincong Chen
165aff38e1 Add CI permission for Chen-0210 (#18494) 2026-02-12 09:33:35 +08:00
Michael
d84d2063d3 [AMD] Fix Janus-Pro crash and add Kimi-K2.5 nightly test (#18269) 2026-02-10 22:33:13 -08:00
Makcum888e
49cbb469b4 [NPU] [CI] Enable run multimodal NPU CI when changes only in multimodal_gen (#18523) 2026-02-10 14:53:43 +03:00
Bingxu Chen
316f9cbb35 [AMD] add amd ci monitor (#17476)
Co-authored-by: michaelzhang-ai <michaelzhang-ai@users.noreply.github.com>
Co-authored-by: YC Tseng <yctseng@amd.com>
2026-02-09 09:04:54 -08:00
Bingxu Chen
3f3c201243 [AMD] Update aiter to v0.1.10.post2 (#18423)
Co-authored-by: kkHuang-amd <wunhuang@amd.com>
Co-authored-by: YC Tseng <yctseng@amd.com>
2026-02-08 22:08:24 -08:00
Hudson Xing
b564dcec61 fix: use --no-build-isolation for human-eval install (#18455) 2026-02-09 13:48:03 +08:00
shuwenn
f600965b0e [CI] fix: notebook ci may not working (#18417) 2026-02-07 22:26:35 -08:00
Makcum888e
00248d85c7 [diffusion] platform: support WAN/FLUX/Qwen-Image/Qwen-Image-edit on Ascend (#13662)
Co-authored-by: dhx98 <haox.dai@gmail.com>
Co-authored-by: DHX98 <haoxiand@andrew.cmu.edu>
Co-authored-by: ronnie_zheng <zl19940307@163.com>
Co-authored-by: DHX98 <DHX98@noreply.gitcode.com>
Co-authored-by: Yuhao Yang <47235274+yhyang201@users.noreply.github.com>
2026-02-08 10:45:30 +08:00
赵晨阳
1552aab741 Support execute_shell_command for env var support (#18390) 2026-02-07 12:33:29 +08:00
Baizhou Zhang
9fbec79906 Revert "[Build] Enable full kernel in aarch64 wheel" (#18385) 2026-02-07 09:19:07 +08:00
Alison Shao
bedade1ef0 Merge stage-c-test-large-4-gpu suites into partitioned suites (#18325) 2026-02-06 15:32:33 -08:00
Baizhou Zhang
f2e0048d06 Add CI permission for Shunkangz, dongjiyingdjy, samuellees (#18377) 2026-02-07 01:19:02 +08:00
zwang86
bdaf3de9b3 fix: add SGLANG_IS_IN_CI env var to release-docs workflow (#18225)
Co-authored-by: Zeyu Wang <zeyu.wang@yahooinc.com>
Co-authored-by: edwingao28 <edwingao28@users.noreply.github.com>
2026-02-04 15:49:41 -08:00
Michael
6fd878b41d [AMD] Add kimi mi35x nightly test, folder organization and several stability fixes (#17895) 2026-02-04 12:03:57 -08:00
Xiaoyu Zhang
2e9d0442e2 [diffusion] update code owner (#18247) 2026-02-04 19:12:32 +08:00
Douglas Yang
b7c1dfc602 fix: bumping nightly whl version (#18212) 2026-02-03 20:43:38 -08:00
Douglas Yang
ae004e15c9 fix: ensuring nightly whls are tagged with latest commit (#18204) 2026-02-03 15:54:41 -08:00