Commit Graph

456 Commits

Author SHA1 Message Date
mispa-ms
d8d9d32b29 [docker] Fix stray backslash dropping sgl-model-gateway COPY (#23097)
Signed-off-by: misunp <misunp@nvidia.com>
Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-20 13:44:05 -07:00
ishandhanani
6f6843c582 [Docker] Move Rust toolchain install to torch_deps stage (#23278) 2026-04-20 13:13:10 -07:00
Alex Nails
332ec5e5ee [release] install rust toolchain in main dockerfile (#23014) 2026-04-20 09:50:08 -07:00
ybyang
271c177443 [NPU]chore(docker): use editable install for sglang in npu.Dockerfile (#23040) 2026-04-17 17:08:39 +08:00
jhchouuu
1412e287bf [AMD][MoRI] bump MoRI to v1.1.0 (#22870) 2026-04-16 00:11:55 -07:00
Vladimir (Vova) Vagaytsev
5ef67cee16 [AMD] Fix aiter import failure in ROCm Docker images (#22363) 2026-04-15 18:40:05 -07:00
Alexis MacAskill
e15401ee0e Add runai-model-streamer into Python packages installed in Dockerfile and fix NotADirectoryError Docker regression (#22537) 2026-04-14 16:25:41 -07:00
Mohammad Miadh Angkad
90ef8ce54d [Docker] Remove flashinfer cache copy (#22653) 2026-04-13 09:48:22 -07:00
Thomas Wang
4a746ea462 [AMD] Remove aiter hotfixes in Dockerfile covered by aiter v0.1.12.post1 (#22657) 2026-04-13 00:01:37 -07:00
Polisetty V R K Jyothendra Varma
7d2c11970c [Intel GPU] Upgrade pytorch xpu version to 2.11 (#21908)
Signed-off-by: P V R K Jyothendra Varma <polisetty.v.r.k.jyothendra.varma@intel.com>
Co-authored-by: Ma Mingfei <mingfei.ma@intel.com>
2026-04-13 13:16:24 +08:00
Mohammad Miadh Angkad
701a0e0c25 [CI/Docker] Clean up redundant flashinfer cubin downloads (#22491) 2026-04-12 12:30:41 -07:00
Bingxu Chen
213027951a [AMD] Upgrade Aiter (#22264) 2026-04-10 18:40:43 -07:00
ishandhanani
aa103eab8d [Docker] Optimize Dockerfile for BuildKit layer caching (#22160) 2026-04-09 15:34:57 -07:00
Kangyan-Zhou
9d905efa2c [Docker] Fix Trivy CVEs, cubin download 403s, and kernels command order (#22322)
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-09 12:26:22 -07:00
sglang-bot
df3275bd6c chore: bump flashinfer version to 0.6.7.post3 (#22382)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
2026-04-08 14:49:45 -07:00
Rain Jiang
1a8eb890f6 Kernels community fa3 (#20796) 2026-04-07 12:48:44 -07:00
sglang-bot
46bf19cdab chore: bump flashinfer version to 0.6.7.post2 (#22097)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
2026-04-04 02:16:25 -07:00
sglang-bot
84118acf50 chore: bump sglang-kernel version to 0.4.1 (#22009)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
2026-04-03 13:58:35 -07:00
Duyi-Wang
ac593fed90 [AMD][Dockerfile] Support build-arg AITER_COMMIT for rocm.Dockerfile (#21949) 2026-04-03 01:54:28 -07:00
monkeyLoveding
658a2813d8 [NPU] Update CI Dependency (#21578) 2026-04-03 16:22:11 +08:00
Thomas Wang
7431db7392 [AMD] Enable FP8 KV cache and FP8 attention kernel for NSA on MI300/MI355 with TileLang backend (#21511) 2026-04-03 00:58:23 -07:00
sglang-bot
ca3ba05a7a chore: bump flashinfer version to 0.6.7 (#21422)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>
2026-03-31 21:18:16 -07:00
Kangyan-Zhou
ea6b22fb85 Fix CVEs in Docker image: pillow, linux-libc-dev, and broken sgl-model-gateway build (#21789)
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-31 20:07:15 -07:00
jhchouuu
4b8456e266 [AMD][MoRI] bump MoRI to v0.1.0 (#21673) 2026-03-30 14:44:11 -07:00
Lianmin Zheng
27ac831a84 docs: improve CI and testing documentation (#21202)
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-23 10:48:50 -07:00
Rain Jiang
cb1e63aba4 bump fa4 to official released fa4 pkg (#20303) 2026-03-17 17:22:56 -07:00
Xiaoyu Zhang
15097c5c3b Release sglang kernel 0.4.0 (#20440)
Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>
2026-03-16 20:34:58 +08:00
sglang-bot
93afe15b43 chore: bump flashinfer version to 0.6.6 (#20480)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
2026-03-14 13:05:10 -07:00
Rain Jiang
ab4b863546 fix ci by removing nvidia-cutlass-dsl-libs-base and force reinstall n… (#20380) 2026-03-11 13:37:33 -07:00
Polisetty V R K Jyothendra Varma
b2dd104ade [Intel GPU] Upgrade pytorch xpu version to 2.10 (#20254)
Signed-off-by: P V R K Jyothendra Varma <polisetty.v.r.k.jyothendra.varma@intel.com>
2026-03-10 18:47:25 -07:00
Bingxu Chen
2e7682414b [AMD] Fix Aiter Prebuild When Releasing ROCm720 Image (#20195) 2026-03-09 21:02:38 -07:00
kk
f016738f4c fix syntax error: "&&" unexpected (#20093) 2026-03-07 02:06:21 -08:00
kk
bd108a5971 Add workaround for aiter triton gemm config issue (#20090)
Co-authored-by: HAI <hixiao@gmail.com>
2026-03-07 01:21:31 -08:00
Thomas Wang
550506894a [AMD] Upgrade aiter version (#19936)
Co-authored-by: HaiShaw <hixiao@gmail.com>
2026-03-06 02:31:04 -08:00
Hubert Lu
05f68e1230 [AMD] Fix the hipDeviceGetName issue in ROCm based docker images (#19440)
Co-authored-by: bingxche <Bingxu.Chen@amd.com>
2026-03-03 11:42:48 -08:00
Mohammad Miadh Angkad
6822941514 [FlashInfer] Bump FlashInfer version from 0.6.3 to 0.6.4 (#19005) 2026-03-02 16:12:09 -08:00
Duyi-Wang
8240a87306 [AMD] MORI-EP support for EP4. (#19578) 2026-02-28 13:13:46 -08:00
Alan Kao
9b2fbf7e6a [AMD] Merge Dockerfiles for ROCm (#19203)
Co-authored-by: bingxche <Bingxu.Chen@amd.com>
2026-02-27 00:53:51 -08:00
Michael
f230967e65 [AMD] Fix ROCm Docker builds, update apache-tvm-ffi (#19359) 2026-02-26 10:16:28 +08:00
Hubert Lu
8bd644765f [AMD] Enable ROCm kvcache JIT path and add AMD CI coverage. (#18992)
Co-authored-by: Cursor <cursoragent@cursor.com>
2026-02-25 14:15:05 +08:00
Simo Lin
edba96b98a fix(docker): migrate ROCm Dockerfiles from setuptools-rust to maturin (#19210)
Signed-off-by: Simo Lin <linsimo.mark@gmail.com>
2026-02-23 19:00:05 -08:00
HAI
6a999dbdf8 [AMD] ENV flags tuning and cleanup (#19176) 2026-02-22 22:40:00 -08:00
Clint
3224836d8b Update rocm7.2 Dockerfile to install amdsmi for QuickReduce Initialization (#19091) 2026-02-22 21:32:14 -08:00
HAI
0215d47007 [AMD] ROCm7.2: Add /sgl-workspace/aiter to PYTHONPATH (#18972) 2026-02-18 02:21:39 -08:00
Duyi-Wang
5ddc84e33e [AMD] MORI-EP inter kernel type switch (#18437)
Co-authored-by: HAI <hixiao@gmail.com>
2026-02-15 20:59:39 -08:00
chenxu214
4e162d4b1b change npu.dockerfile (#18835) 2026-02-15 20:43:15 +08:00
Mohammad Miadh Angkad
1be41e9036 [FlashInfer] Bump FlashInfer version from 0.6.2 to 0.6.3 (#18448) 2026-02-14 07:43:33 +08:00
HAI
f4417475b8 Build ROCm7.2 Image with latest AITER v0.1.10.post3 (#18741) 2026-02-12 14:30:13 -08:00
Thomas Wang
e20e6c28b9 [AMD] Fix accuracy issue when running TP4 dsv3 model with mtp (#18607)
Co-authored-by: YC Tseng <yctseng@amd.com>
Co-authored-by: kkHuang-amd <wunhuang@amd.com>
2026-02-12 01:13:16 -08:00
YC Tseng
20554a0a4f [AMD] rocm 7.2 image release, PR test, Nightly Test (#17799)
Co-authored-by: Alan Kao <akao@amd.com>
Co-authored-by: bingxche <Bingxu.Chen@amd.com>
Co-authored-by: Michael <13900043+michaelzhang-ai@users.noreply.github.com>
2026-02-11 21:29:25 -08:00