mispa-ms
|
d8d9d32b29
|
[docker] Fix stray backslash dropping sgl-model-gateway COPY (#23097)
Signed-off-by: misunp <misunp@nvidia.com>
Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
|
2026-04-20 13:44:05 -07:00 |
|
ishandhanani
|
6f6843c582
|
[Docker] Move Rust toolchain install to torch_deps stage (#23278)
|
2026-04-20 13:13:10 -07:00 |
|
Alex Nails
|
332ec5e5ee
|
[release] install rust toolchain in main dockerfile (#23014)
|
2026-04-20 09:50:08 -07:00 |
|
ybyang
|
271c177443
|
[NPU]chore(docker): use editable install for sglang in npu.Dockerfile (#23040)
|
2026-04-17 17:08:39 +08:00 |
|
jhchouuu
|
1412e287bf
|
[AMD][MoRI] bump MoRI to v1.1.0 (#22870)
|
2026-04-16 00:11:55 -07:00 |
|
Vladimir (Vova) Vagaytsev
|
5ef67cee16
|
[AMD] Fix aiter import failure in ROCm Docker images (#22363)
|
2026-04-15 18:40:05 -07:00 |
|
Alexis MacAskill
|
e15401ee0e
|
Add runai-model-streamer into Python packages installed in Dockerfile and fix NotADirectoryError Docker regression (#22537)
|
2026-04-14 16:25:41 -07:00 |
|
Mohammad Miadh Angkad
|
90ef8ce54d
|
[Docker] Remove flashinfer cache copy (#22653)
|
2026-04-13 09:48:22 -07:00 |
|
Thomas Wang
|
4a746ea462
|
[AMD] Remove aiter hotfixes in Dockerfile covered by aiter v0.1.12.post1 (#22657)
|
2026-04-13 00:01:37 -07:00 |
|
Polisetty V R K Jyothendra Varma
|
7d2c11970c
|
[Intel GPU] Upgrade pytorch xpu version to 2.11 (#21908)
Signed-off-by: P V R K Jyothendra Varma <polisetty.v.r.k.jyothendra.varma@intel.com>
Co-authored-by: Ma Mingfei <mingfei.ma@intel.com>
|
2026-04-13 13:16:24 +08:00 |
|
Mohammad Miadh Angkad
|
701a0e0c25
|
[CI/Docker] Clean up redundant flashinfer cubin downloads (#22491)
|
2026-04-12 12:30:41 -07:00 |
|
Bingxu Chen
|
213027951a
|
[AMD] Upgrade Aiter (#22264)
|
2026-04-10 18:40:43 -07:00 |
|
ishandhanani
|
aa103eab8d
|
[Docker] Optimize Dockerfile for BuildKit layer caching (#22160)
|
2026-04-09 15:34:57 -07:00 |
|
Kangyan-Zhou
|
9d905efa2c
|
[Docker] Fix Trivy CVEs, cubin download 403s, and kernels command order (#22322)
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
|
2026-04-09 12:26:22 -07:00 |
|
sglang-bot
|
df3275bd6c
|
chore: bump flashinfer version to 0.6.7.post3 (#22382)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
|
2026-04-08 14:49:45 -07:00 |
|
Rain Jiang
|
1a8eb890f6
|
Kernels community fa3 (#20796)
|
2026-04-07 12:48:44 -07:00 |
|
sglang-bot
|
46bf19cdab
|
chore: bump flashinfer version to 0.6.7.post2 (#22097)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
|
2026-04-04 02:16:25 -07:00 |
|
sglang-bot
|
84118acf50
|
chore: bump sglang-kernel version to 0.4.1 (#22009)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
|
2026-04-03 13:58:35 -07:00 |
|
Duyi-Wang
|
ac593fed90
|
[AMD][Dockerfile] Support build-arg AITER_COMMIT for rocm.Dockerfile (#21949)
|
2026-04-03 01:54:28 -07:00 |
|
monkeyLoveding
|
658a2813d8
|
[NPU] Update CI Dependency (#21578)
|
2026-04-03 16:22:11 +08:00 |
|
Thomas Wang
|
7431db7392
|
[AMD] Enable FP8 KV cache and FP8 attention kernel for NSA on MI300/MI355 with TileLang backend (#21511)
|
2026-04-03 00:58:23 -07:00 |
|
sglang-bot
|
ca3ba05a7a
|
chore: bump flashinfer version to 0.6.7 (#21422)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>
|
2026-03-31 21:18:16 -07:00 |
|
Kangyan-Zhou
|
ea6b22fb85
|
Fix CVEs in Docker image: pillow, linux-libc-dev, and broken sgl-model-gateway build (#21789)
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
|
2026-03-31 20:07:15 -07:00 |
|
jhchouuu
|
4b8456e266
|
[AMD][MoRI] bump MoRI to v0.1.0 (#21673)
|
2026-03-30 14:44:11 -07:00 |
|
Lianmin Zheng
|
27ac831a84
|
docs: improve CI and testing documentation (#21202)
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
|
2026-03-23 10:48:50 -07:00 |
|
Rain Jiang
|
cb1e63aba4
|
bump fa4 to official released fa4 pkg (#20303)
|
2026-03-17 17:22:56 -07:00 |
|
Xiaoyu Zhang
|
15097c5c3b
|
Release sglang kernel 0.4.0 (#20440)
Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>
|
2026-03-16 20:34:58 +08:00 |
|
sglang-bot
|
93afe15b43
|
chore: bump flashinfer version to 0.6.6 (#20480)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
|
2026-03-14 13:05:10 -07:00 |
|
Rain Jiang
|
ab4b863546
|
fix ci by removing nvidia-cutlass-dsl-libs-base and force reinstall n… (#20380)
|
2026-03-11 13:37:33 -07:00 |
|
Polisetty V R K Jyothendra Varma
|
b2dd104ade
|
[Intel GPU] Upgrade pytorch xpu version to 2.10 (#20254)
Signed-off-by: P V R K Jyothendra Varma <polisetty.v.r.k.jyothendra.varma@intel.com>
|
2026-03-10 18:47:25 -07:00 |
|
Bingxu Chen
|
2e7682414b
|
[AMD] Fix Aiter Prebuild When Releasing ROCm720 Image (#20195)
|
2026-03-09 21:02:38 -07:00 |
|
kk
|
f016738f4c
|
fix syntax error: "&&" unexpected (#20093)
|
2026-03-07 02:06:21 -08:00 |
|
kk
|
bd108a5971
|
Add workaround for aiter triton gemm config issue (#20090)
Co-authored-by: HAI <hixiao@gmail.com>
|
2026-03-07 01:21:31 -08:00 |
|
Thomas Wang
|
550506894a
|
[AMD] Upgrade aiter version (#19936)
Co-authored-by: HaiShaw <hixiao@gmail.com>
|
2026-03-06 02:31:04 -08:00 |
|
Hubert Lu
|
05f68e1230
|
[AMD] Fix the hipDeviceGetName issue in ROCm based docker images (#19440)
Co-authored-by: bingxche <Bingxu.Chen@amd.com>
|
2026-03-03 11:42:48 -08:00 |
|
Mohammad Miadh Angkad
|
6822941514
|
[FlashInfer] Bump FlashInfer version from 0.6.3 to 0.6.4 (#19005)
|
2026-03-02 16:12:09 -08:00 |
|
Duyi-Wang
|
8240a87306
|
[AMD] MORI-EP support for EP4. (#19578)
|
2026-02-28 13:13:46 -08:00 |
|
Alan Kao
|
9b2fbf7e6a
|
[AMD] Merge Dockerfiles for ROCm (#19203)
Co-authored-by: bingxche <Bingxu.Chen@amd.com>
|
2026-02-27 00:53:51 -08:00 |
|
Michael
|
f230967e65
|
[AMD] Fix ROCm Docker builds, update apache-tvm-ffi (#19359)
|
2026-02-26 10:16:28 +08:00 |
|
Hubert Lu
|
8bd644765f
|
[AMD] Enable ROCm kvcache JIT path and add AMD CI coverage. (#18992)
Co-authored-by: Cursor <cursoragent@cursor.com>
|
2026-02-25 14:15:05 +08:00 |
|
Simo Lin
|
edba96b98a
|
fix(docker): migrate ROCm Dockerfiles from setuptools-rust to maturin (#19210)
Signed-off-by: Simo Lin <linsimo.mark@gmail.com>
|
2026-02-23 19:00:05 -08:00 |
|
HAI
|
6a999dbdf8
|
[AMD] ENV flags tuning and cleanup (#19176)
|
2026-02-22 22:40:00 -08:00 |
|
Clint
|
3224836d8b
|
Update rocm7.2 Dockerfile to install amdsmi for QuickReduce Initialization (#19091)
|
2026-02-22 21:32:14 -08:00 |
|
HAI
|
0215d47007
|
[AMD] ROCm7.2: Add /sgl-workspace/aiter to PYTHONPATH (#18972)
|
2026-02-18 02:21:39 -08:00 |
|
Duyi-Wang
|
5ddc84e33e
|
[AMD] MORI-EP inter kernel type switch (#18437)
Co-authored-by: HAI <hixiao@gmail.com>
|
2026-02-15 20:59:39 -08:00 |
|
chenxu214
|
4e162d4b1b
|
change npu.dockerfile (#18835)
|
2026-02-15 20:43:15 +08:00 |
|
Mohammad Miadh Angkad
|
1be41e9036
|
[FlashInfer] Bump FlashInfer version from 0.6.2 to 0.6.3 (#18448)
|
2026-02-14 07:43:33 +08:00 |
|
HAI
|
f4417475b8
|
Build ROCm7.2 Image with latest AITER v0.1.10.post3 (#18741)
|
2026-02-12 14:30:13 -08:00 |
|
Thomas Wang
|
e20e6c28b9
|
[AMD] Fix accuracy issue when running TP4 dsv3 model with mtp (#18607)
Co-authored-by: YC Tseng <yctseng@amd.com>
Co-authored-by: kkHuang-amd <wunhuang@amd.com>
|
2026-02-12 01:13:16 -08:00 |
|
YC Tseng
|
20554a0a4f
|
[AMD] rocm 7.2 image release, PR test, Nightly Test (#17799)
Co-authored-by: Alan Kao <akao@amd.com>
Co-authored-by: bingxche <Bingxu.Chen@amd.com>
Co-authored-by: Michael <13900043+michaelzhang-ai@users.noreply.github.com>
|
2026-02-11 21:29:25 -08:00 |
|