Commit Graph

62 Commits

Author SHA1 Message Date
Simo Lin
92c5749f41 refactor: replace local proto compilation with smg-grpc-proto package (#18682) 2026-02-12 05:29:24 -08:00
Mick
977096ae03 [diffusion] cli: introduce generic attention backend configuration in ServerArgs (#18036) 2026-02-02 09:47:40 +08:00
RoyWang
30adf78f82 [diffusion]: align sglang diffusion AMD pyproject_other.toml diffusion dependency with pyproject.toml (#16225)
Co-authored-by: roywang <roywang@amd.com>
2026-01-29 01:50:57 -08:00
Prozac614
3fcda00e8c [CI] Fix CI timeouts by upgrading runai_model_streamer (related to #16937) (#17636) 2026-01-28 17:09:45 -08:00
gingerXue
e9d727cb92 [MUSA][7/N] Enhance CUDA / PyNccl wrapper to support MTLink connectivity detection (#17499)
Signed-off-by: jingzhi.xue <jingzhi.xue@mthreads.com>
Co-authored-by: jingzhi.xue <jingzhi.xue@mthreads.com>
2026-01-28 11:36:30 -08:00
Makcum888e
bba6e38ff8 [NPU] Split pyproject npu from pyproject other (#17641) 2026-01-26 09:45:44 -08:00
R0CKSTAR
a77729a276 [MUSA][1/N] sglang.check_env (#16959)
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
2026-01-23 14:41:17 -08:00
Baizhou Zhang
fafa171529 [hotfix] Fixes on cuda 13 docker image (#17541)
Co-authored-by: iforgetmyname <iforgetmyname@users.noreply.github>
2026-01-22 12:29:55 +08:00
Lianmin Zheng
b74a57a8d9 [Auto Sync] Update detokenizer_manager.py, io_struct.py, mu... (20260120) (#17442)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Wangfan Fu <wangfan@x.ai>
2026-01-21 14:48:32 -08:00
DarkSharpness
95f59c13fd [Chore] include all jit files in building packages (#17493) 2026-01-21 14:48:02 -08:00
Jacob Gordon
cda43ffa4d ci: avoids duplication of codespell config (#17519) 2026-01-21 12:02:37 -08:00
R0CKSTAR
a1dd3d48ac [diffusion] hardware: support diffusion (single GPU, 3/N) (#17105)
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
2026-01-16 17:01:09 +08:00
Simo Lin
55a8dd0095 [grpc] Fix protobuf compilation in isolated build environments (#16754) 2026-01-08 13:09:01 -08:00
ishandhanani
0500fea965 fix editable install (#16241) 2025-12-31 14:34:54 -08:00
Prozac614
f253f43c9d [diffusion] CI: fix LoRA downloading issues and respect offline flag (#15813) 2025-12-30 11:39:27 +08:00
Kangyan-Zhou
9c4eb46099 Add a new branch cut GH workflow, and adopt setuptools-scm for version control (#15985) 2025-12-29 13:51:21 -08:00
Yuzhen Zhou
4bf06635fc [diffusion] multi-platform: support diffusion on amd and fix encoder loading on MI325 (#13760)
Co-authored-by: Sabre Shao <sabre.shao@amd.com>
Co-authored-by: Yusheng (Ethan) Su <yushengsu.thu@gmail.com>
Co-authored-by: Hubert Lu <Hubert.Lu@amd.com>
Co-authored-by: xsun <sunxiao04@gmail.com>
2025-12-19 15:38:46 +08:00
sglang-bot
5c8bd8b51b chore: bump SGLang version to 0.5.6.post2 (#14858)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
2025-12-11 12:29:52 -08:00
Yuhao Yang
c1bd5ee8c5 Revert transformers to 4.57.1 (#14801) 2025-12-10 11:04:36 -08:00
sglang-bot
9a327bdfcf chore: bump SGLang version to 0.5.6.post1 (#14651) 2025-12-09 00:35:28 +08:00
Yuhao Yang
8200fb56cb update transformers package version to 5.0.0rc0 (#14356) 2025-12-08 22:46:01 +08:00
sglang-bot
7ae368efde chore: bump SGLang version to 0.5.6 (#14316)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
2025-12-02 17:17:13 -08:00
Lianmin Zheng
bc3d2a85af [Minor] update docs (#14212) 2025-12-01 02:33:58 -08:00
strgrb
65ba5ab8b1 add cpp files for cpp_radix_tree to pyproject.toml. (#14052) 2025-11-30 13:05:04 +08:00
DarkSharpness
c1dd9a9599 [Fix] JIT kernel dependencies in other platforms (#13889) 2025-11-24 23:19:17 -08:00
Liangsheng Yin
b60e769d0e Tiny unpin uvloop for other backends (#13858) 2025-11-25 01:34:57 +08:00
Swipe4057
d5e0346847 xgrammar up version to 0.1.27 (#13650) 2025-11-24 10:53:45 +08:00
Even Zhou
589d9ad55b [NPU] chore: bump to CANN 8.3.RC1 and Pytorch 2.8.0 (#13647) 2025-11-21 17:07:08 +08:00
sglang-bot
7b2fb3d47c chore: bump SGLang version to 0.5.5.post3 (#13366) 2025-11-16 17:55:38 -08:00
sglang-bot
ebaf86d441 chore: bump SGLang version to 0.5.5.post2 (#13129)
Include the critical fix https://github.com/sgl-project/sglang/pull/12915.
2025-11-12 20:35:20 +08:00
sglang-bot
303cc957e6 chore: bump SGLang version to 0.5.5.post1 (#13000) 2025-11-10 11:53:43 -08:00
Gaurav Verma
34ba10efd6 [AMD] Update wave-lang to 3.8.2 (#12576)
Signed-off-by: xintin <gaurav.verma@amd.com>
2025-11-06 15:51:44 -08:00
R0CKSTAR
b07c5e4080 Pin uvloop to 0.21.0 (#12279)
Signed-off-by: Xiaodong Ye <yeahdongcn@gmail.com>
2025-11-07 03:33:31 +08:00
sglang-bot
0c006b8809 chore: bump SGLang version to 0.5.5 (#12739) 2025-11-07 00:46:19 +08:00
sglang-bot
09938e1f82 chore: bump SGLang version to 0.5.4.post3 (#12639) 2025-11-04 18:32:11 -08:00
sglang-bot
41c10e67fc chore: bump SGLang version to 0.5.4.post2 (#12439) 2025-10-31 17:38:50 -07:00
sglang-bot
55d75e11bd chore: bump SGLang version to 0.5.4.post1 (#12169) 2025-10-27 09:35:20 +08:00
sglang-bot
1053e1be17 chore: bump SGLang version to 0.5.4 (#12027)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
2025-10-23 18:01:40 -07:00
Gaurav Verma
6f9b66bdda [AMD] Update wave-lang to 3.8.0 (#11878)
Signed-off-by: xintin <gaurav.verma@amd.com>
2025-10-20 23:11:09 -07:00
Lianmin Zheng
67e34c56d7 Fix install instructions and pyproject.tomls (#11781) 2025-10-18 01:08:01 -07:00
sglang-bot
85ebeecf06 chore: bump SGLang version to 0.5.3.post3 (#11693)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
2025-10-16 13:14:55 -07:00
sglang-bot
baf277a9bf chore: bump SGLang version to 0.5.3.post2 (#11680)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
2025-10-15 16:49:14 -07:00
Sahithi Chigurupati
e9e120ac7a fix: upgrade transformers to 4.57.1 (#11628)
Signed-off-by: Sahithi Chigurupati <chigurupati.sahithi@gmail.com>
Co-authored-by: zhyncs <me@zhyncs.com>
2025-10-14 18:35:05 -07:00
Yineng Zhang
4299aebdbb chore: update pyproject (#11420) 2025-10-10 00:56:30 -07:00
sglang-bot
758b887ad1 chore: bump SGLang version to 0.5.3.post1 (#11324) 2025-10-09 15:19:59 -07:00
Yineng Zhang
44cb060785 chore: upgrade flashinfer 0.4.0 (#11364) 2025-10-09 14:17:54 -07:00
Lifu Huang
edefab0c64 [2/2] Support MHA prefill with FlashAttention 4. (#10937)
Co-authored-by: Hieu Pham <hyhieu@gmail.com>
2025-10-08 00:54:20 -07:00
DarkSharpness
832c84fba9 [Chore] Update xgrammar 0.1.24 -> 0.1.25 (#10710) 2025-10-07 18:22:28 -07:00
Lianmin Zheng
eb30b888db Remove env var warnings for release (#11262) 2025-10-06 10:09:17 -07:00
sglang-bot
a4a3d82393 chore: bump SGLang version to 0.5.3 (#11263) 2025-10-06 20:07:02 +08:00