HAI
|
934b36693c
|
Reasoning models fix docs (#18963)
|
2026-02-17 23:05:55 -08:00 |
|
rinbaro
|
de6a03260f
|
[docs] fix misspellings & typos (#18276)
|
2026-02-05 03:35:29 +00:00 |
|
Xiaoyu Zhang
|
c08b54a575
|
[JIT kernel] Update jit_kernel cache and develop doc (#17842)
|
2026-01-28 15:09:47 +08:00 |
|
Hubert Lu
|
93423ff780
|
[AMD] Deprecate ROCm 6.3 artifacts and standardize gfx942 on ROCm 7 (#17785)
|
2026-01-27 15:58:49 -08:00 |
|
zijiexia
|
9f8b79f16f
|
[Docs] Fix formatting in Evaluating New Models with SGLang (#17376)
|
2026-01-19 18:22:30 -08:00 |
|
zijiexia
|
79ddc34c1c
|
[Docs] Add new model evaluation docs (#17043)
Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>
Co-authored-by: 赵晨阳 <zhaochen20@outlook.com>
|
2026-01-19 16:35:03 -08:00 |
|
Liangsheng Yin
|
77d3566555
|
Tiny fix wording about CI preemption. (#16773)
|
2026-01-09 11:08:17 +08:00 |
|
DarkSharpness
|
291f11ae39
|
[Minor] Enhance JIT kernel and add dev docs (#14570)
|
2025-12-23 22:34:59 +08:00 |
|
Baizhou Zhang
|
42fcf5438f
|
Revert "tiny remove deprecated endpoint call" (#14533)
|
2025-12-05 23:48:54 -08:00 |
|
Alison Shao
|
e41664ba1a
|
[Docs] Add /rerun-stage command to contribution guide (#14521)
|
2025-12-05 15:46:47 -08:00 |
|
b8zhong
|
ec7b2c16d9
|
tiny remove deprecated endpoint call (#13607)
|
2025-12-05 09:54:49 -08:00 |
|
sglang-bot
|
b5d3998508
|
Rename secrets.WHL_TOKEN -> secrets.GH_PAT_FOR_WHL_RELEASE (#14421)
Co-authored-by: Lianmin Zheng <lianminzheng@gmail.com>
|
2025-12-04 18:24:54 -08:00 |
|
Lianmin Zheng
|
8a7b1b8301
|
[Docs] Update CI docs (#14260)
|
2025-12-01 18:15:03 -08:00 |
|
Liangsheng Yin
|
19729f723e
|
[CI] Align metric units for CI rate limit (#13633)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
|
2025-11-20 16:25:57 +08:00 |
|
Liangsheng Yin
|
109f27ba3a
|
[CI] update pr-gate to be compatible with new slash triggering mananer. (#13522)
|
2025-11-19 00:49:13 +08:00 |
|
sglang-bot
|
c1a30aa765
|
Add /tag-and-rerun-ci (#13521)
|
2025-11-18 06:53:53 -08:00 |
|
Lianmin Zheng
|
2e1dbdb258
|
Update docs (#13519)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
|
2025-11-18 06:24:58 -08:00 |
|
Lianmin Zheng
|
6d025fd35b
|
Trigger CI retry with edit (#13516)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
|
2025-11-18 05:42:31 -08:00 |
|
Lianmin Zheng
|
63807079b9
|
Add docs on trigger ci (#13513)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
|
2025-11-18 05:23:05 -08:00 |
|
Lianmin Zheng
|
e2d6746808
|
Add .github/CI_PERMISSIONS.json to define the CI permissions (#13509)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
|
2025-11-18 04:00:15 -08:00 |
|
Liangsheng Yin
|
4e41edcb9c
|
[CI] remove auto-labeling run-ci label. (#13486)
|
2025-11-18 14:59:46 +08:00 |
|
Ying Sheng
|
15bc1f5cd7
|
Update .github/MAINTAINER.md (#13398)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
|
2025-11-16 21:32:24 -08:00 |
|
Lianmin Zheng
|
7e626d12b7
|
Update docs (#13391)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
|
2025-11-16 19:36:33 -08:00 |
|
kyleliang-nv
|
597d416070
|
[feature] Add layerwise NVTX support (#11870)
|
2025-11-15 19:20:56 -08:00 |
|
Kangyan-Zhou
|
6a3b9fd00f
|
Update setup_github_runner.md
|
2025-11-02 20:44:09 -08:00 |
|
Kangyan-Zhou
|
ceb105a780
|
Update setup_github_runner.md
|
2025-10-25 09:22:00 -07:00 |
|
Qiaolin Yu
|
547003bdd0
|
fix command line usage of profiling (#11793)
|
2025-10-18 12:54:36 +08:00 |
|
Lianmin Zheng
|
b9a54e0968
|
[minor] sync code on python/sglang/test/test_deterministic.py and improve ci tests (#11777)
Co-authored-by: Stefan He <hebiaobuaa@gmail.com>
Co-authored-by: Byron Hsu <byronhsu1230@gmail.com>
|
2025-10-17 14:25:22 -07:00 |
|
Xiaoyu Zhang
|
88a6f9dab5
|
bench_serving support PD Disaggregation (#11542)
|
2025-10-13 19:43:26 -07:00 |
|
Neelabh Sinha
|
aaf7af1b17
|
[FEATURE] Add Profile Trace Merger for Distributed Traces (#11413)
|
2025-10-14 09:20:17 +08:00 |
|
Kevin Xiang Li
|
e3bb7f5ae6
|
benchmark: enhance configurable multimodal benchmarking in bench_serving (#9812)
Co-authored-by: Xiang (Kevin) Li <lik@nvidia.com>
Co-authored-by: Xinyuan Tong <115166877+JustinTong0323@users.noreply.github.com>
|
2025-10-08 01:31:36 -07:00 |
|
Lianmin Zheng
|
b1f0fc1c0b
|
Add CI timeout guidelines (#10829)
|
2025-09-23 22:08:02 -07:00 |
|
Lianmin Zheng
|
50dc0c1e9c
|
Run tests based on labels (#10456)
|
2025-09-15 00:29:20 -07:00 |
|
Teng Ma
|
a02071a12c
|
[Bench] feat: mooncake trace integration (#9839)
Signed-off-by: Xuchun Shang <xuchun.shang@linux.alibaba.com>
Signed-off-by: Teng Ma <sima.mt@alibaba-inc.com>
Co-authored-by: Xuchun Shang <xuchun.shang@linux.alibaba.com>
|
2025-09-09 02:50:54 +08:00 |
|
yhyang201
|
a85363c199
|
[docs] Instructions for bench_serving.py (#9071)
Co-authored-by: Mick <mickjagger19@icloud.com>
Co-authored-by: Xinyuan Tong <115166877+JustinTong0323@users.noreply.github.com>
Co-authored-by: zhaochenyang20 <zhaochen20@outlook.com>
Co-authored-by: zhaochenyang20 <zhaochenyang20@gmail.com>
Co-authored-by: Yineng Zhang <me@zhyncs.com>
|
2025-08-26 18:30:57 -07:00 |
|
Lianmin Zheng
|
1ec9769753
|
[Docs] Update contribution guide (#9383)
|
2025-08-19 23:37:45 -07:00 |
|
Lianmin Zheng
|
ecc9f3e47a
|
[Minor] Fix the style of sgl-kernel (#9332)
|
2025-08-18 23:45:00 -07:00 |
|
Lianmin Zheng
|
c480a3f6ea
|
Minor style fixes for sgl-kernel (#9289)
|
2025-08-18 09:38:35 -07:00 |
|
Yineng Zhang
|
fab0f6e77d
|
chore: bump v0.5.0rc2 (#9203)
|
2025-08-14 16:11:16 -07:00 |
|
Lianmin Zheng
|
9e426466af
|
Clean up allocators (#9134)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
|
2025-08-13 13:56:04 -07:00 |
|
Lianmin Zheng
|
2e8e7e353b
|
Improve docs and developer guide (#9044)
|
2025-08-10 21:05:18 -07:00 |
|
Lianmin Zheng
|
2449a0afe2
|
Refactor the docs (#9031)
|
2025-08-10 19:49:45 -07:00 |
|