Commit Graph

503 Commits

Author SHA1 Message Date
Prozac614
57c5c343d7 [diffusion] model: support Hunyuan3D-2 (#18170)
Co-authored-by: yingluosanqian <yingluosanqian@gmail.com>
Co-authored-by: daiweitao <dwti614707404@163.com>
Co-authored-by: Mick <mickjagger19@icloud.com>
2026-03-02 12:28:05 +08:00
DefTruth
78d6674c45 [diffusion] feat: support hybrid parallelism for diffusers backend (#19405) 2026-02-27 00:06:08 +08:00
Mick
241ee90164 [diffusion] chore: tiny fix pyproject.toml (#19256) 2026-02-25 11:57:53 +08:00
GMI Xiao Jin
fcfd964d7d [diffusion] model: LTX-2 Support PR3 (#19151) 2026-02-24 16:55:28 +08:00
Mohammad Miadh Angkad
1be41e9036 [FlashInfer] Bump FlashInfer version from 0.6.2 to 0.6.3 (#18448) 2026-02-14 07:43:33 +08:00
Simo Lin
92c5749f41 refactor: replace local proto compilation with smg-grpc-proto package (#18682) 2026-02-12 05:29:24 -08:00
shaharmor98
c6aa1863be Add Nemotron 3 Nano tests (#18119)
Signed-off-by: Shahar Mor <smor@nvidia.com>
2026-02-06 23:55:42 +08:00
linhaifeng
c1d5cc3b24 [Bugfix] fix a obvious logic error (#18254) 2026-02-04 13:59:58 -08:00
Mick
977096ae03 [diffusion] cli: introduce generic attention backend configuration in ServerArgs (#18036) 2026-02-02 09:47:40 +08:00
Baizhou Zhang
c7d53fa26a Set torch url index in pyproject.toml (#16802) 2026-02-01 13:23:52 +08:00
Prozac614
3fcda00e8c [CI] Fix CI timeouts by upgrading runai_model_streamer (related to #16937) (#17636) 2026-01-28 17:09:45 -08:00
shaharmor98
f6f1b6d000 Bump FI version (#17700)
Signed-off-by: Shahar Mor <smor@nvidia.com>
Co-authored-by: b8zhong <b8zhong@uwaterloo.ca>
2026-01-26 16:50:06 +08:00
Kangyan-Zhou
48f4340b14 Exclude some diffusion package for ARM in docker release (#17745) 2026-01-25 23:32:39 -08:00
Kangyan-Zhou
8d3e1ac0c8 Add an all type in pyproject.tml to include diffusion support (#17697) 2026-01-25 12:52:13 -08:00
Chi McIsaac
71482dd171 [diffusion] feat: enable passing Cache‑DiT config for diffusers backend (#16662)
Signed-off-by: Chi <chixie.mcisaac@gmail.com>
Signed-off-by: qimcis <chixie.mcisaac@gmail.com>
Co-authored-by: Mick <mickjagger19@icloud.com>
2026-01-22 13:13:34 +08:00
Baizhou Zhang
fafa171529 [hotfix] Fixes on cuda 13 docker image (#17541)
Co-authored-by: iforgetmyname <iforgetmyname@users.noreply.github>
2026-01-22 12:29:55 +08:00
Lianmin Zheng
b74a57a8d9 [Auto Sync] Update detokenizer_manager.py, io_struct.py, mu... (20260120) (#17442)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Wangfan Fu <wangfan@x.ai>
2026-01-21 14:48:32 -08:00
DarkSharpness
95f59c13fd [Chore] include all jit files in building packages (#17493) 2026-01-21 14:48:02 -08:00
Jacob Gordon
cda43ffa4d ci: avoids duplication of codespell config (#17519) 2026-01-21 12:02:37 -08:00
Baizhou Zhang
ea879c7739 [Minor] Correct sglang version when installing from source (#17315) 2026-01-18 19:36:16 -08:00
Baizhou Zhang
a04675892e Update flashinfer to 0.6.1 (#15551) 2026-01-17 00:48:30 +08:00
R0CKSTAR
a1dd3d48ac [diffusion] hardware: support diffusion (single GPU, 3/N) (#17105)
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
2026-01-16 17:01:09 +08:00
sglang-bot
000ad42225 chore: bump sgl-kernel version to 0.3.21 (#17075)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
2026-01-15 12:41:17 +08:00
Xiaoyu Zhang
740d3c0b39 [Diffusion] Remove useless dependency in diffusion (#16967) 2026-01-13 17:25:53 +08:00
Baizhou Zhang
9fd2358cc2 Update Cutedsl version and pin cuda-python version (#16838) 2026-01-10 17:08:43 +08:00
Chang Su
16880235d1 [grpc] Auto-generate protobuf files during wheel build (#16409) 2026-01-08 09:09:54 -08:00
Liangsheng Yin
a7fd810842 Allow editable install without .git with add fallback version in pyproject.toml (#16435) 2026-01-05 11:17:20 +08:00
ishandhanani
0500fea965 fix editable install (#16241) 2025-12-31 14:34:54 -08:00
Kangyan-Zhou
9c4eb46099 Add a new branch cut GH workflow, and adopt setuptools-scm for version control (#15985) 2025-12-29 13:51:21 -08:00
Prozac614
3778c2fc6d [diffusion] CI: fix CI test case skip problem (#15874)
Co-authored-by: Mick <mickjagger19@icloud.com>
2025-12-26 19:42:20 +08:00
sglang-bot
34013d9d5a chore: bump sgl-kernel version to 0.3.20 (#15590)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
2025-12-22 12:32:34 -08:00
Yineng Zhang
0861dca81f Revert "[misc] Upgrade cutedsl to 4.3.1 (#14857)" (#15293) 2025-12-16 16:31:32 -08:00
Baizhou Zhang
0261c4aff7 [misc] Upgrade cutedsl to 4.3.1 (#14857) 2025-12-16 12:11:56 -08:00
Lianmin Zheng
267170bf1d Clean up server args and engine startup processes (#15015) 2025-12-12 18:46:07 -08:00
DefTruth
d71baa72dc [diffusion] dependency: upgrade cache-dit for better compatibility (#14534) 2025-12-12 18:52:58 +08:00
sglang-bot
5c8bd8b51b chore: bump SGLang version to 0.5.6.post2 (#14858)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
2025-12-11 12:29:52 -08:00
Yuhao Yang
c1bd5ee8c5 Revert transformers to 4.57.1 (#14801) 2025-12-10 11:04:36 -08:00
Lianmin Zheng
18bd8e8d6d Improve CI by trying a warmup before unit tests (#14669) 2025-12-09 15:17:59 -08:00
Binyao Jiang
6abb8051e8 Bump up diffusers to latest official release version (#14670) 2025-12-08 13:41:01 -08:00
sglang-bot
9a327bdfcf chore: bump SGLang version to 0.5.6.post1 (#14651) 2025-12-09 00:35:28 +08:00
sglang-bot
2de98010b5 chore: bump sgl-kernel version to 0.3.19 (#14649) 2025-12-08 22:53:08 +08:00
Yuhao Yang
8200fb56cb update transformers package version to 5.0.0rc0 (#14356) 2025-12-08 22:46:01 +08:00
Binyao Jiang
cf0478d602 [Glm46v] Bug fix for accuracy drop and unable to launch server (#14585)
Co-authored-by: yhyang201 <yhyang201@gmail.com>
Co-authored-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Co-authored-by: Minglei Zhu <mingleizhu1122@gmail.com>
2025-12-07 23:45:02 -08:00
sglang-bot
d2b42477c7 chore: bump sgl-kernel version to 0.3.18.post3 (#14518) 2025-12-06 13:15:16 -08:00
Mick
d881f31488 [diffusion] chore: temporarily upgrade diffusers to make Z-image compatible with Cache-DiT (#14530) 2025-12-06 12:39:37 +08:00
blahblah
66984a8b3d [diffusion] feat: support cache-dit integration (#14234)
Co-authored-by: shuxiguo <shuxiguo@meituan.com>
Co-authored-by: DefTruth <qiustudent_r@163.com>
Co-authored-by: Mick <mickjagger19@icloud.com>
2025-12-06 00:52:22 +08:00
zyksir
fa0ca97694 [diffusion] improve: further optimize model load (#13836) 2025-12-05 10:45:20 +08:00
sglang-bot
7ae368efde chore: bump SGLang version to 0.5.6 (#14316)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
2025-12-02 17:17:13 -08:00
Lianmin Zheng
ca52ed425f Clean up imports and move files (#14317) 2025-12-02 16:31:54 -08:00
sglang-bot
63b9300f00 chore: bump sgl-kernel version to 0.3.18.post2 (#14244) 2025-12-01 23:14:12 -08:00