DarkSharpness
|
95f59c13fd
|
[Chore] include all jit files in building packages (#17493)
|
2026-01-21 14:48:02 -08:00 |
|
Jacob Gordon
|
cda43ffa4d
|
ci: avoids duplication of codespell config (#17519)
|
2026-01-21 12:02:37 -08:00 |
|
Baizhou Zhang
|
ea879c7739
|
[Minor] Correct sglang version when installing from source (#17315)
|
2026-01-18 19:36:16 -08:00 |
|
Baizhou Zhang
|
a04675892e
|
Update flashinfer to 0.6.1 (#15551)
|
2026-01-17 00:48:30 +08:00 |
|
R0CKSTAR
|
a1dd3d48ac
|
[diffusion] hardware: support diffusion (single GPU, 3/N) (#17105)
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
|
2026-01-16 17:01:09 +08:00 |
|
sglang-bot
|
000ad42225
|
chore: bump sgl-kernel version to 0.3.21 (#17075)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
|
2026-01-15 12:41:17 +08:00 |
|
Xiaoyu Zhang
|
740d3c0b39
|
[Diffusion] Remove useless dependency in diffusion (#16967)
|
2026-01-13 17:25:53 +08:00 |
|
Baizhou Zhang
|
9fd2358cc2
|
Update Cutedsl version and pin cuda-python version (#16838)
|
2026-01-10 17:08:43 +08:00 |
|
Chang Su
|
16880235d1
|
[grpc] Auto-generate protobuf files during wheel build (#16409)
|
2026-01-08 09:09:54 -08:00 |
|
Liangsheng Yin
|
a7fd810842
|
Allow editable install without .git with add fallback version in pyproject.toml (#16435)
|
2026-01-05 11:17:20 +08:00 |
|
ishandhanani
|
0500fea965
|
fix editable install (#16241)
|
2025-12-31 14:34:54 -08:00 |
|
Kangyan-Zhou
|
9c4eb46099
|
Add a new branch cut GH workflow, and adopt setuptools-scm for version control (#15985)
|
2025-12-29 13:51:21 -08:00 |
|
Prozac614
|
3778c2fc6d
|
[diffusion] CI: fix CI test case skip problem (#15874)
Co-authored-by: Mick <mickjagger19@icloud.com>
|
2025-12-26 19:42:20 +08:00 |
|
sglang-bot
|
34013d9d5a
|
chore: bump sgl-kernel version to 0.3.20 (#15590)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
|
2025-12-22 12:32:34 -08:00 |
|
Yineng Zhang
|
0861dca81f
|
Revert "[misc] Upgrade cutedsl to 4.3.1 (#14857)" (#15293)
|
2025-12-16 16:31:32 -08:00 |
|
Baizhou Zhang
|
0261c4aff7
|
[misc] Upgrade cutedsl to 4.3.1 (#14857)
|
2025-12-16 12:11:56 -08:00 |
|
Lianmin Zheng
|
267170bf1d
|
Clean up server args and engine startup processes (#15015)
|
2025-12-12 18:46:07 -08:00 |
|
DefTruth
|
d71baa72dc
|
[diffusion] dependency: upgrade cache-dit for better compatibility (#14534)
|
2025-12-12 18:52:58 +08:00 |
|
sglang-bot
|
5c8bd8b51b
|
chore: bump SGLang version to 0.5.6.post2 (#14858)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
|
2025-12-11 12:29:52 -08:00 |
|
Yuhao Yang
|
c1bd5ee8c5
|
Revert transformers to 4.57.1 (#14801)
|
2025-12-10 11:04:36 -08:00 |
|
Lianmin Zheng
|
18bd8e8d6d
|
Improve CI by trying a warmup before unit tests (#14669)
|
2025-12-09 15:17:59 -08:00 |
|
Binyao Jiang
|
6abb8051e8
|
Bump up diffusers to latest official release version (#14670)
|
2025-12-08 13:41:01 -08:00 |
|
sglang-bot
|
9a327bdfcf
|
chore: bump SGLang version to 0.5.6.post1 (#14651)
|
2025-12-09 00:35:28 +08:00 |
|
sglang-bot
|
2de98010b5
|
chore: bump sgl-kernel version to 0.3.19 (#14649)
|
2025-12-08 22:53:08 +08:00 |
|
Yuhao Yang
|
8200fb56cb
|
update transformers package version to 5.0.0rc0 (#14356)
|
2025-12-08 22:46:01 +08:00 |
|
Binyao Jiang
|
cf0478d602
|
[Glm46v] Bug fix for accuracy drop and unable to launch server (#14585)
Co-authored-by: yhyang201 <yhyang201@gmail.com>
Co-authored-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Co-authored-by: Minglei Zhu <mingleizhu1122@gmail.com>
|
2025-12-07 23:45:02 -08:00 |
|
sglang-bot
|
d2b42477c7
|
chore: bump sgl-kernel version to 0.3.18.post3 (#14518)
|
2025-12-06 13:15:16 -08:00 |
|
Mick
|
d881f31488
|
[diffusion] chore: temporarily upgrade diffusers to make Z-image compatible with Cache-DiT (#14530)
|
2025-12-06 12:39:37 +08:00 |
|
blahblah
|
66984a8b3d
|
[diffusion] feat: support cache-dit integration (#14234)
Co-authored-by: shuxiguo <shuxiguo@meituan.com>
Co-authored-by: DefTruth <qiustudent_r@163.com>
Co-authored-by: Mick <mickjagger19@icloud.com>
|
2025-12-06 00:52:22 +08:00 |
|
zyksir
|
fa0ca97694
|
[diffusion] improve: further optimize model load (#13836)
|
2025-12-05 10:45:20 +08:00 |
|
sglang-bot
|
7ae368efde
|
chore: bump SGLang version to 0.5.6 (#14316)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
|
2025-12-02 17:17:13 -08:00 |
|
Lianmin Zheng
|
ca52ed425f
|
Clean up imports and move files (#14317)
|
2025-12-02 16:31:54 -08:00 |
|
sglang-bot
|
63b9300f00
|
chore: bump sgl-kernel version to 0.3.18.post2 (#14244)
|
2025-12-01 23:14:12 -08:00 |
|
strgrb
|
65ba5ab8b1
|
add cpp files for cpp_radix_tree to pyproject.toml. (#14052)
|
2025-11-30 13:05:04 +08:00 |
|
sglang-bot
|
c53e729d45
|
chore: bump sgl-kernel version to 0.3.18.post1 (#13951)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
|
2025-11-25 18:14:28 -08:00 |
|
Fan Yin
|
36b1bcd242
|
[chore] update torch version to 2.9 (#12969)
Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>
|
2025-11-25 14:47:34 -08:00 |
|
Lzhang-hub
|
760c20b360
|
update flashinfer_cubin==0.5.3 (#13848)
|
2025-11-25 00:10:34 -08:00 |
|
Binyao Jiang
|
de430b6745
|
[Performance] Replace preprocess_video logic from GLM multimodal processor with transformer impl for speed up (up to 27% faster) and addressing OOM (up to 50x improvements) (#13487)
|
2025-11-24 18:17:13 -08:00 |
|
Zhi Yiliu
|
a95a38078b
|
[Fix] Fix uvloop get_event_loop() is not suitable for 0.22.x (#13612)
Signed-off-by: lzy <tomlzy213@gmail.com>
Co-authored-by: lzy <tomlzy213@gmail.com>
|
2025-11-25 01:20:00 +08:00 |
|
Baizhou Zhang
|
04b52fa8d6
|
[chore]Upgrade flashinfer to 0.5.3 (#13751)
|
2025-11-23 23:38:36 -08:00 |
|
Yuan Luo
|
f56b9b42e6
|
[Bugfix] Add jit kernel files in packaging (#13829)
Co-authored-by: luoyuan.luo <luoyuan.luo@antgroup.com>
Co-authored-by: Xu Yongfei <xuyongfei.xyf@antgroup.com>
|
2025-11-24 12:32:16 +08:00 |
|
Swipe4057
|
d5e0346847
|
xgrammar up version to 0.1.27 (#13650)
|
2025-11-24 10:53:45 +08:00 |
|
sglang-bot
|
bfaf0b8607
|
chore: bump sgl-kernel version to 0.3.17.post2 (#13570)
|
2025-11-19 14:02:57 -08:00 |
|
sglang-bot
|
7b2fb3d47c
|
chore: bump SGLang version to 0.5.5.post3 (#13366)
|
2025-11-16 17:55:38 -08:00 |
|
b8zhong
|
d5fa58c4dd
|
fix nightly docker build (#13386)
|
2025-11-16 11:21:09 -08:00 |
|
sglang-bot
|
1ca205f6da
|
chore: bump sgl-kernel version to 0.3.17.post1 (#13358)
|
2025-11-15 19:11:41 -08:00 |
|
Yineng Zhang
|
f8d3d80f63
|
chore: bump flashinfer v0.5.2 (#13242)
|
2025-11-14 02:47:09 -08:00 |
|
sglang-bot
|
ebaf86d441
|
chore: bump SGLang version to 0.5.5.post2 (#13129)
Include the critical fix https://github.com/sgl-project/sglang/pull/12915.
|
2025-11-12 20:35:20 +08:00 |
|
sglang-bot
|
303cc957e6
|
chore: bump SGLang version to 0.5.5.post1 (#13000)
|
2025-11-10 11:53:43 -08:00 |
|
sglang-bot
|
37c40a87a8
|
chore: bump sgl-kernel version to 0.3.17 (#12966)
|
2025-11-10 21:50:58 +08:00 |
|