Bingxu Chen
|
69eb95f20c
|
[AMD] Pin peft<0.19 in pyproject_other.toml to fix ROCm CI ImportError (#23161)
Co-authored-by: HAI <hixiao@gmail.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
|
2026-04-19 23:43:56 -07:00 |
|
YC Yen-Ching Tseng
|
f0f0148167
|
Revert "feat: Support MXFP4 quantized dense models on AMD CDNA2/CDNA3 GPUs (#19143)" (#23031)
|
2026-04-16 21:53:25 -07:00 |
|
Daifeng Li
|
2cc52d8326
|
feat: Support MXFP4 quantized dense models on AMD CDNA2/CDNA3 GPUs (#19143)
|
2026-04-16 16:51:32 -07:00 |
|
Xinyuan Tong
|
34fef07a15
|
Upgrade transformers to 5.5.3 and refactor hf_transformers_utils into subpackage (#21569)
|
2026-04-15 20:03:44 -07:00 |
|
Asish Kumar
|
39810762d2
|
fix: use describe mode for SGLang version detection (#22600)
Signed-off-by: Asish Kumar <officialasishkumar@gmail.com>
|
2026-04-13 09:45:45 -07:00 |
|
ori
|
f7a1740101
|
[MUSA][9/N] Add FA3 attention backend support through MATE (MUSA AI Tensor Engine) (#22051)
Co-authored-by: zhiguo.qin <zhiguo.qin@mthreads.com>
|
2026-04-10 14:18:39 -07:00 |
|
Chang Min Bark
|
a757c1e3fb
|
[Apple Silicon] [MLX] Add mlx and mlx-lm dependencies (#22162)
Co-authored-by: R0CKSTAR <yeahdongcn@gmail.com>
|
2026-04-07 11:36:43 +08:00 |
|
Kangyan-Zhou
|
93109cc89b
|
[Fix] Fix setuptools-scm version resolution for rc tags (#22165)
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>
|
2026-04-05 16:55:32 -07:00 |
|
Baizhou Zhang
|
efa7b2d5d3
|
Revert "[MUSA][9/N] Add FA3 attention backend support through MATE (MUSA AI Tensor Engine)" (#22002)
|
2026-04-02 20:42:13 -07:00 |
|
ori
|
939cf398a9
|
[MUSA][9/N] Add FA3 attention backend support through MATE (MUSA AI Tensor Engine) (#17985)
Co-authored-by: R0CKSTAR <xiaodong.ye@mthreads.com>
|
2026-04-02 15:04:31 -07:00 |
|
R0CKSTAR
|
ca3286d2d5
|
[diffusion] hardware: support FA3 attention backend on MUSA (attn backend, 14/N) (#18648)
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
Co-authored-by: Mick <mickjagger19@icloud.com>
|
2026-04-01 10:49:34 -07:00 |
|
jacky.cheng
|
c86f6c2831
|
[AMD] Add peft>=0.18.0 to diffusion_hip deps for transformers 5.x compat for AMD diffusion model (#21442)
Co-authored-by: HaiShaw <hixiao@gmail.com>
|
2026-03-28 20:28:05 -07:00 |
|
Anant Sharma
|
f289d173aa
|
[Deps] Bump xgrammar to 0.1.32 (#21032)
|
2026-03-26 01:22:37 -07:00 |
|
R0CKSTAR
|
a305964159
|
[MLX] Add native MLX execution backend for Apple Silicon Mac (#20342)
Signed-off-by: Xiaodong Ye <yeahdongcn@gmail.com>
|
2026-03-26 00:09:17 -07:00 |
|
R0CKSTAR
|
7553b7dcb0
|
chore: extract diffusion_common in python/pyproject_other.toml (#20803)
Signed-off-by: Xiaodong Ye <yeahdongcn@gmail.com>
|
2026-03-19 10:39:16 +08:00 |
|
Xinyuan Tong
|
d1e95af282
|
Upgrade transformers==5.3.0 (#17784)
Signed-off-by: Xinyuan Tong <xinyuantong.cs@gmail.com>
Co-authored-by: Kangyan-Zhou <zky314343421@gmail.com>
Co-authored-by: Alison Shao <alisonshao@mac.lan>
Co-authored-by: Mick <mickjagger19@icloud.com>
|
2026-03-18 13:50:43 -07:00 |
|
Simo Lin
|
654fc02cf1
|
[gRPC] Extract gRPC servicer into standalone package (#20478)
Signed-off-by: Simo Lin <linsimo.mark@gmail.com>
|
2026-03-13 09:13:29 -07:00 |
|
Yuhao Yang
|
a57a44739f
|
[diffusion] deps: upgrade diffusers from 0.36.0 to 0.37.0 (#20318)
|
2026-03-12 19:17:28 +08:00 |
|
R0CKSTAR
|
db97f193b7
|
[diffusion][llm] macOS support (#19549)
Signed-off-by: Xiaodong Ye <yeahdongcn@gmail.com>
Co-authored-by: Mick <mickjagger19@icloud.com>
|
2026-03-10 13:11:07 -07:00 |
|
Xinyuan Tong
|
4a757990a1
|
[VLM] Replace decord with torchcodec for video decoding (#20055)
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
Co-authored-by: BakerBunker <17872844+BakerBunker@users.noreply.github.com>
|
2026-03-09 19:23:49 +08:00 |
|
Jasonzhang517
|
d939e26585
|
[model gateway][0/N] router EPD support: add encoder grpc server backend support (#16552)
Co-authored-by: Zongyao Chen <ZongYao.Chen@linux.alibaba.com>
Co-authored-by: Zongyao Chen <solar1s@163.com>
|
2026-03-03 19:38:15 +08:00 |
|
Prozac614
|
57c5c343d7
|
[diffusion] model: support Hunyuan3D-2 (#18170)
Co-authored-by: yingluosanqian <yingluosanqian@gmail.com>
Co-authored-by: daiweitao <dwti614707404@163.com>
Co-authored-by: Mick <mickjagger19@icloud.com>
|
2026-03-02 12:28:05 +08:00 |
|
GMI Xiao Jin
|
fcfd964d7d
|
[diffusion] model: LTX-2 Support PR3 (#19151)
|
2026-02-24 16:55:28 +08:00 |
|
Simo Lin
|
92c5749f41
|
refactor: replace local proto compilation with smg-grpc-proto package (#18682)
|
2026-02-12 05:29:24 -08:00 |
|
Mick
|
977096ae03
|
[diffusion] cli: introduce generic attention backend configuration in ServerArgs (#18036)
|
2026-02-02 09:47:40 +08:00 |
|
RoyWang
|
30adf78f82
|
[diffusion]: align sglang diffusion AMD pyproject_other.toml diffusion dependency with pyproject.toml (#16225)
Co-authored-by: roywang <roywang@amd.com>
|
2026-01-29 01:50:57 -08:00 |
|
Prozac614
|
3fcda00e8c
|
[CI] Fix CI timeouts by upgrading runai_model_streamer (related to #16937) (#17636)
|
2026-01-28 17:09:45 -08:00 |
|
gingerXue
|
e9d727cb92
|
[MUSA][7/N] Enhance CUDA / PyNccl wrapper to support MTLink connectivity detection (#17499)
Signed-off-by: jingzhi.xue <jingzhi.xue@mthreads.com>
Co-authored-by: jingzhi.xue <jingzhi.xue@mthreads.com>
|
2026-01-28 11:36:30 -08:00 |
|
Makcum888e
|
bba6e38ff8
|
[NPU] Split pyproject npu from pyproject other (#17641)
|
2026-01-26 09:45:44 -08:00 |
|
R0CKSTAR
|
a77729a276
|
[MUSA][1/N] sglang.check_env (#16959)
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
|
2026-01-23 14:41:17 -08:00 |
|
Baizhou Zhang
|
fafa171529
|
[hotfix] Fixes on cuda 13 docker image (#17541)
Co-authored-by: iforgetmyname <iforgetmyname@users.noreply.github>
|
2026-01-22 12:29:55 +08:00 |
|
Lianmin Zheng
|
b74a57a8d9
|
[Auto Sync] Update detokenizer_manager.py, io_struct.py, mu... (20260120) (#17442)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Wangfan Fu <wangfan@x.ai>
|
2026-01-21 14:48:32 -08:00 |
|
DarkSharpness
|
95f59c13fd
|
[Chore] include all jit files in building packages (#17493)
|
2026-01-21 14:48:02 -08:00 |
|
Jacob Gordon
|
cda43ffa4d
|
ci: avoids duplication of codespell config (#17519)
|
2026-01-21 12:02:37 -08:00 |
|
R0CKSTAR
|
a1dd3d48ac
|
[diffusion] hardware: support diffusion (single GPU, 3/N) (#17105)
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
|
2026-01-16 17:01:09 +08:00 |
|
Simo Lin
|
55a8dd0095
|
[grpc] Fix protobuf compilation in isolated build environments (#16754)
|
2026-01-08 13:09:01 -08:00 |
|
ishandhanani
|
0500fea965
|
fix editable install (#16241)
|
2025-12-31 14:34:54 -08:00 |
|
Prozac614
|
f253f43c9d
|
[diffusion] CI: fix LoRA downloading issues and respect offline flag (#15813)
|
2025-12-30 11:39:27 +08:00 |
|
Kangyan-Zhou
|
9c4eb46099
|
Add a new branch cut GH workflow, and adopt setuptools-scm for version control (#15985)
|
2025-12-29 13:51:21 -08:00 |
|
Yuzhen Zhou
|
4bf06635fc
|
[diffusion] multi-platform: support diffusion on amd and fix encoder loading on MI325 (#13760)
Co-authored-by: Sabre Shao <sabre.shao@amd.com>
Co-authored-by: Yusheng (Ethan) Su <yushengsu.thu@gmail.com>
Co-authored-by: Hubert Lu <Hubert.Lu@amd.com>
Co-authored-by: xsun <sunxiao04@gmail.com>
|
2025-12-19 15:38:46 +08:00 |
|
sglang-bot
|
5c8bd8b51b
|
chore: bump SGLang version to 0.5.6.post2 (#14858)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
|
2025-12-11 12:29:52 -08:00 |
|
Yuhao Yang
|
c1bd5ee8c5
|
Revert transformers to 4.57.1 (#14801)
|
2025-12-10 11:04:36 -08:00 |
|
sglang-bot
|
9a327bdfcf
|
chore: bump SGLang version to 0.5.6.post1 (#14651)
|
2025-12-09 00:35:28 +08:00 |
|
Yuhao Yang
|
8200fb56cb
|
update transformers package version to 5.0.0rc0 (#14356)
|
2025-12-08 22:46:01 +08:00 |
|
sglang-bot
|
7ae368efde
|
chore: bump SGLang version to 0.5.6 (#14316)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
|
2025-12-02 17:17:13 -08:00 |
|
Lianmin Zheng
|
bc3d2a85af
|
[Minor] update docs (#14212)
|
2025-12-01 02:33:58 -08:00 |
|
strgrb
|
65ba5ab8b1
|
add cpp files for cpp_radix_tree to pyproject.toml. (#14052)
|
2025-11-30 13:05:04 +08:00 |
|
DarkSharpness
|
c1dd9a9599
|
[Fix] JIT kernel dependencies in other platforms (#13889)
|
2025-11-24 23:19:17 -08:00 |
|
Liangsheng Yin
|
b60e769d0e
|
Tiny unpin uvloop for other backends (#13858)
|
2025-11-25 01:34:57 +08:00 |
|
Swipe4057
|
d5e0346847
|
xgrammar up version to 0.1.27 (#13650)
|
2025-11-24 10:53:45 +08:00 |
|