Prozac614
|
57c5c343d7
|
[diffusion] model: support Hunyuan3D-2 (#18170)
Co-authored-by: yingluosanqian <yingluosanqian@gmail.com>
Co-authored-by: daiweitao <dwti614707404@163.com>
Co-authored-by: Mick <mickjagger19@icloud.com>
|
2026-03-02 12:28:05 +08:00 |
|
DefTruth
|
78d6674c45
|
[diffusion] feat: support hybrid parallelism for diffusers backend (#19405)
|
2026-02-27 00:06:08 +08:00 |
|
Mick
|
241ee90164
|
[diffusion] chore: tiny fix pyproject.toml (#19256)
|
2026-02-25 11:57:53 +08:00 |
|
GMI Xiao Jin
|
fcfd964d7d
|
[diffusion] model: LTX-2 Support PR3 (#19151)
|
2026-02-24 16:55:28 +08:00 |
|
Mohammad Miadh Angkad
|
1be41e9036
|
[FlashInfer] Bump FlashInfer version from 0.6.2 to 0.6.3 (#18448)
|
2026-02-14 07:43:33 +08:00 |
|
Simo Lin
|
92c5749f41
|
refactor: replace local proto compilation with smg-grpc-proto package (#18682)
|
2026-02-12 05:29:24 -08:00 |
|
shaharmor98
|
c6aa1863be
|
Add Nemotron 3 Nano tests (#18119)
Signed-off-by: Shahar Mor <smor@nvidia.com>
|
2026-02-06 23:55:42 +08:00 |
|
linhaifeng
|
c1d5cc3b24
|
[Bugfix] fix a obvious logic error (#18254)
|
2026-02-04 13:59:58 -08:00 |
|
Mick
|
977096ae03
|
[diffusion] cli: introduce generic attention backend configuration in ServerArgs (#18036)
|
2026-02-02 09:47:40 +08:00 |
|
Baizhou Zhang
|
c7d53fa26a
|
Set torch url index in pyproject.toml (#16802)
|
2026-02-01 13:23:52 +08:00 |
|
Prozac614
|
3fcda00e8c
|
[CI] Fix CI timeouts by upgrading runai_model_streamer (related to #16937) (#17636)
|
2026-01-28 17:09:45 -08:00 |
|
shaharmor98
|
f6f1b6d000
|
Bump FI version (#17700)
Signed-off-by: Shahar Mor <smor@nvidia.com>
Co-authored-by: b8zhong <b8zhong@uwaterloo.ca>
|
2026-01-26 16:50:06 +08:00 |
|
Kangyan-Zhou
|
48f4340b14
|
Exclude some diffusion package for ARM in docker release (#17745)
|
2026-01-25 23:32:39 -08:00 |
|
Kangyan-Zhou
|
8d3e1ac0c8
|
Add an all type in pyproject.tml to include diffusion support (#17697)
|
2026-01-25 12:52:13 -08:00 |
|
Chi McIsaac
|
71482dd171
|
[diffusion] feat: enable passing Cache‑DiT config for diffusers backend (#16662)
Signed-off-by: Chi <chixie.mcisaac@gmail.com>
Signed-off-by: qimcis <chixie.mcisaac@gmail.com>
Co-authored-by: Mick <mickjagger19@icloud.com>
|
2026-01-22 13:13:34 +08:00 |
|
Baizhou Zhang
|
fafa171529
|
[hotfix] Fixes on cuda 13 docker image (#17541)
Co-authored-by: iforgetmyname <iforgetmyname@users.noreply.github>
|
2026-01-22 12:29:55 +08:00 |
|
Lianmin Zheng
|
b74a57a8d9
|
[Auto Sync] Update detokenizer_manager.py, io_struct.py, mu... (20260120) (#17442)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Wangfan Fu <wangfan@x.ai>
|
2026-01-21 14:48:32 -08:00 |
|
DarkSharpness
|
95f59c13fd
|
[Chore] include all jit files in building packages (#17493)
|
2026-01-21 14:48:02 -08:00 |
|
Jacob Gordon
|
cda43ffa4d
|
ci: avoids duplication of codespell config (#17519)
|
2026-01-21 12:02:37 -08:00 |
|
Baizhou Zhang
|
ea879c7739
|
[Minor] Correct sglang version when installing from source (#17315)
|
2026-01-18 19:36:16 -08:00 |
|
Baizhou Zhang
|
a04675892e
|
Update flashinfer to 0.6.1 (#15551)
|
2026-01-17 00:48:30 +08:00 |
|
R0CKSTAR
|
a1dd3d48ac
|
[diffusion] hardware: support diffusion (single GPU, 3/N) (#17105)
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
|
2026-01-16 17:01:09 +08:00 |
|
sglang-bot
|
000ad42225
|
chore: bump sgl-kernel version to 0.3.21 (#17075)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
|
2026-01-15 12:41:17 +08:00 |
|
Xiaoyu Zhang
|
740d3c0b39
|
[Diffusion] Remove useless dependency in diffusion (#16967)
|
2026-01-13 17:25:53 +08:00 |
|
Baizhou Zhang
|
9fd2358cc2
|
Update Cutedsl version and pin cuda-python version (#16838)
|
2026-01-10 17:08:43 +08:00 |
|
Chang Su
|
16880235d1
|
[grpc] Auto-generate protobuf files during wheel build (#16409)
|
2026-01-08 09:09:54 -08:00 |
|
Liangsheng Yin
|
a7fd810842
|
Allow editable install without .git with add fallback version in pyproject.toml (#16435)
|
2026-01-05 11:17:20 +08:00 |
|
ishandhanani
|
0500fea965
|
fix editable install (#16241)
|
2025-12-31 14:34:54 -08:00 |
|
Kangyan-Zhou
|
9c4eb46099
|
Add a new branch cut GH workflow, and adopt setuptools-scm for version control (#15985)
|
2025-12-29 13:51:21 -08:00 |
|
Prozac614
|
3778c2fc6d
|
[diffusion] CI: fix CI test case skip problem (#15874)
Co-authored-by: Mick <mickjagger19@icloud.com>
|
2025-12-26 19:42:20 +08:00 |
|
sglang-bot
|
34013d9d5a
|
chore: bump sgl-kernel version to 0.3.20 (#15590)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
|
2025-12-22 12:32:34 -08:00 |
|
Yineng Zhang
|
0861dca81f
|
Revert "[misc] Upgrade cutedsl to 4.3.1 (#14857)" (#15293)
|
2025-12-16 16:31:32 -08:00 |
|
Baizhou Zhang
|
0261c4aff7
|
[misc] Upgrade cutedsl to 4.3.1 (#14857)
|
2025-12-16 12:11:56 -08:00 |
|
Lianmin Zheng
|
267170bf1d
|
Clean up server args and engine startup processes (#15015)
|
2025-12-12 18:46:07 -08:00 |
|
DefTruth
|
d71baa72dc
|
[diffusion] dependency: upgrade cache-dit for better compatibility (#14534)
|
2025-12-12 18:52:58 +08:00 |
|
sglang-bot
|
5c8bd8b51b
|
chore: bump SGLang version to 0.5.6.post2 (#14858)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
|
2025-12-11 12:29:52 -08:00 |
|
Yuhao Yang
|
c1bd5ee8c5
|
Revert transformers to 4.57.1 (#14801)
|
2025-12-10 11:04:36 -08:00 |
|
Lianmin Zheng
|
18bd8e8d6d
|
Improve CI by trying a warmup before unit tests (#14669)
|
2025-12-09 15:17:59 -08:00 |
|
Binyao Jiang
|
6abb8051e8
|
Bump up diffusers to latest official release version (#14670)
|
2025-12-08 13:41:01 -08:00 |
|
sglang-bot
|
9a327bdfcf
|
chore: bump SGLang version to 0.5.6.post1 (#14651)
|
2025-12-09 00:35:28 +08:00 |
|
sglang-bot
|
2de98010b5
|
chore: bump sgl-kernel version to 0.3.19 (#14649)
|
2025-12-08 22:53:08 +08:00 |
|
Yuhao Yang
|
8200fb56cb
|
update transformers package version to 5.0.0rc0 (#14356)
|
2025-12-08 22:46:01 +08:00 |
|
Binyao Jiang
|
cf0478d602
|
[Glm46v] Bug fix for accuracy drop and unable to launch server (#14585)
Co-authored-by: yhyang201 <yhyang201@gmail.com>
Co-authored-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Co-authored-by: Minglei Zhu <mingleizhu1122@gmail.com>
|
2025-12-07 23:45:02 -08:00 |
|
sglang-bot
|
d2b42477c7
|
chore: bump sgl-kernel version to 0.3.18.post3 (#14518)
|
2025-12-06 13:15:16 -08:00 |
|
Mick
|
d881f31488
|
[diffusion] chore: temporarily upgrade diffusers to make Z-image compatible with Cache-DiT (#14530)
|
2025-12-06 12:39:37 +08:00 |
|
blahblah
|
66984a8b3d
|
[diffusion] feat: support cache-dit integration (#14234)
Co-authored-by: shuxiguo <shuxiguo@meituan.com>
Co-authored-by: DefTruth <qiustudent_r@163.com>
Co-authored-by: Mick <mickjagger19@icloud.com>
|
2025-12-06 00:52:22 +08:00 |
|
zyksir
|
fa0ca97694
|
[diffusion] improve: further optimize model load (#13836)
|
2025-12-05 10:45:20 +08:00 |
|
sglang-bot
|
7ae368efde
|
chore: bump SGLang version to 0.5.6 (#14316)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
|
2025-12-02 17:17:13 -08:00 |
|
Lianmin Zheng
|
ca52ed425f
|
Clean up imports and move files (#14317)
|
2025-12-02 16:31:54 -08:00 |
|
sglang-bot
|
63b9300f00
|
chore: bump sgl-kernel version to 0.3.18.post2 (#14244)
|
2025-12-01 23:14:12 -08:00 |
|