Commit Graph

536 Commits

Author SHA1 Message Date
DarkSharpness
95f59c13fd [Chore] include all jit files in building packages (#17493) 2026-01-21 14:48:02 -08:00
Jacob Gordon
cda43ffa4d ci: avoids duplication of codespell config (#17519) 2026-01-21 12:02:37 -08:00
Baizhou Zhang
ea879c7739 [Minor] Correct sglang version when installing from source (#17315) 2026-01-18 19:36:16 -08:00
Baizhou Zhang
a04675892e Update flashinfer to 0.6.1 (#15551) 2026-01-17 00:48:30 +08:00
R0CKSTAR
a1dd3d48ac [diffusion] hardware: support diffusion (single GPU, 3/N) (#17105)
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
2026-01-16 17:01:09 +08:00
sglang-bot
000ad42225 chore: bump sgl-kernel version to 0.3.21 (#17075)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
2026-01-15 12:41:17 +08:00
Xiaoyu Zhang
740d3c0b39 [Diffusion] Remove useless dependency in diffusion (#16967) 2026-01-13 17:25:53 +08:00
Baizhou Zhang
9fd2358cc2 Update Cutedsl version and pin cuda-python version (#16838) 2026-01-10 17:08:43 +08:00
Chang Su
16880235d1 [grpc] Auto-generate protobuf files during wheel build (#16409) 2026-01-08 09:09:54 -08:00
Liangsheng Yin
a7fd810842 Allow editable install without .git with add fallback version in pyproject.toml (#16435) 2026-01-05 11:17:20 +08:00
ishandhanani
0500fea965 fix editable install (#16241) 2025-12-31 14:34:54 -08:00
Kangyan-Zhou
9c4eb46099 Add a new branch cut GH workflow, and adopt setuptools-scm for version control (#15985) 2025-12-29 13:51:21 -08:00
Prozac614
3778c2fc6d [diffusion] CI: fix CI test case skip problem (#15874)
Co-authored-by: Mick <mickjagger19@icloud.com>
2025-12-26 19:42:20 +08:00
sglang-bot
34013d9d5a chore: bump sgl-kernel version to 0.3.20 (#15590)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
2025-12-22 12:32:34 -08:00
Yineng Zhang
0861dca81f Revert "[misc] Upgrade cutedsl to 4.3.1 (#14857)" (#15293) 2025-12-16 16:31:32 -08:00
Baizhou Zhang
0261c4aff7 [misc] Upgrade cutedsl to 4.3.1 (#14857) 2025-12-16 12:11:56 -08:00
Lianmin Zheng
267170bf1d Clean up server args and engine startup processes (#15015) 2025-12-12 18:46:07 -08:00
DefTruth
d71baa72dc [diffusion] dependency: upgrade cache-dit for better compatibility (#14534) 2025-12-12 18:52:58 +08:00
sglang-bot
5c8bd8b51b chore: bump SGLang version to 0.5.6.post2 (#14858)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
2025-12-11 12:29:52 -08:00
Yuhao Yang
c1bd5ee8c5 Revert transformers to 4.57.1 (#14801) 2025-12-10 11:04:36 -08:00
Lianmin Zheng
18bd8e8d6d Improve CI by trying a warmup before unit tests (#14669) 2025-12-09 15:17:59 -08:00
Binyao Jiang
6abb8051e8 Bump up diffusers to latest official release version (#14670) 2025-12-08 13:41:01 -08:00
sglang-bot
9a327bdfcf chore: bump SGLang version to 0.5.6.post1 (#14651) 2025-12-09 00:35:28 +08:00
sglang-bot
2de98010b5 chore: bump sgl-kernel version to 0.3.19 (#14649) 2025-12-08 22:53:08 +08:00
Yuhao Yang
8200fb56cb update transformers package version to 5.0.0rc0 (#14356) 2025-12-08 22:46:01 +08:00
Binyao Jiang
cf0478d602 [Glm46v] Bug fix for accuracy drop and unable to launch server (#14585)
Co-authored-by: yhyang201 <yhyang201@gmail.com>
Co-authored-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Co-authored-by: Minglei Zhu <mingleizhu1122@gmail.com>
2025-12-07 23:45:02 -08:00
sglang-bot
d2b42477c7 chore: bump sgl-kernel version to 0.3.18.post3 (#14518) 2025-12-06 13:15:16 -08:00
Mick
d881f31488 [diffusion] chore: temporarily upgrade diffusers to make Z-image compatible with Cache-DiT (#14530) 2025-12-06 12:39:37 +08:00
blahblah
66984a8b3d [diffusion] feat: support cache-dit integration (#14234)
Co-authored-by: shuxiguo <shuxiguo@meituan.com>
Co-authored-by: DefTruth <qiustudent_r@163.com>
Co-authored-by: Mick <mickjagger19@icloud.com>
2025-12-06 00:52:22 +08:00
zyksir
fa0ca97694 [diffusion] improve: further optimize model load (#13836) 2025-12-05 10:45:20 +08:00
sglang-bot
7ae368efde chore: bump SGLang version to 0.5.6 (#14316)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
2025-12-02 17:17:13 -08:00
Lianmin Zheng
ca52ed425f Clean up imports and move files (#14317) 2025-12-02 16:31:54 -08:00
sglang-bot
63b9300f00 chore: bump sgl-kernel version to 0.3.18.post2 (#14244) 2025-12-01 23:14:12 -08:00
strgrb
65ba5ab8b1 add cpp files for cpp_radix_tree to pyproject.toml. (#14052) 2025-11-30 13:05:04 +08:00
sglang-bot
c53e729d45 chore: bump sgl-kernel version to 0.3.18.post1 (#13951)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
2025-11-25 18:14:28 -08:00
Fan Yin
36b1bcd242 [chore] update torch version to 2.9 (#12969)
Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>
2025-11-25 14:47:34 -08:00
Lzhang-hub
760c20b360 update flashinfer_cubin==0.5.3 (#13848) 2025-11-25 00:10:34 -08:00
Binyao Jiang
de430b6745 [Performance] Replace preprocess_video logic from GLM multimodal processor with transformer impl for speed up (up to 27% faster) and addressing OOM (up to 50x improvements) (#13487) 2025-11-24 18:17:13 -08:00
Zhi Yiliu
a95a38078b [Fix] Fix uvloop get_event_loop() is not suitable for 0.22.x (#13612)
Signed-off-by: lzy <tomlzy213@gmail.com>
Co-authored-by: lzy <tomlzy213@gmail.com>
2025-11-25 01:20:00 +08:00
Baizhou Zhang
04b52fa8d6 [chore]Upgrade flashinfer to 0.5.3 (#13751) 2025-11-23 23:38:36 -08:00
Yuan Luo
f56b9b42e6 [Bugfix] Add jit kernel files in packaging (#13829)
Co-authored-by: luoyuan.luo <luoyuan.luo@antgroup.com>
Co-authored-by: Xu Yongfei <xuyongfei.xyf@antgroup.com>
2025-11-24 12:32:16 +08:00
Swipe4057
d5e0346847 xgrammar up version to 0.1.27 (#13650) 2025-11-24 10:53:45 +08:00
sglang-bot
bfaf0b8607 chore: bump sgl-kernel version to 0.3.17.post2 (#13570) 2025-11-19 14:02:57 -08:00
sglang-bot
7b2fb3d47c chore: bump SGLang version to 0.5.5.post3 (#13366) 2025-11-16 17:55:38 -08:00
b8zhong
d5fa58c4dd fix nightly docker build (#13386) 2025-11-16 11:21:09 -08:00
sglang-bot
1ca205f6da chore: bump sgl-kernel version to 0.3.17.post1 (#13358) 2025-11-15 19:11:41 -08:00
Yineng Zhang
f8d3d80f63 chore: bump flashinfer v0.5.2 (#13242) 2025-11-14 02:47:09 -08:00
sglang-bot
ebaf86d441 chore: bump SGLang version to 0.5.5.post2 (#13129)
Include the critical fix https://github.com/sgl-project/sglang/pull/12915.
2025-11-12 20:35:20 +08:00
sglang-bot
303cc957e6 chore: bump SGLang version to 0.5.5.post1 (#13000) 2025-11-10 11:53:43 -08:00
sglang-bot
37c40a87a8 chore: bump sgl-kernel version to 0.3.17 (#12966) 2025-11-10 21:50:58 +08:00