Commit Graph

57 Commits

Author SHA1 Message Date
ybyang
41258f874d [PD]feat(bench): add --fake-prefill flag for decode-only stress testing (#22973) 2026-04-16 13:57:55 -07:00
Zaire
71377deda7 [Docs] fix profiling endpoint (#22982)
Signed-off-by: Zaire404 <3147879462@qq.com>
2026-04-16 12:51:39 -04:00
David Cheung
ed427e1299 Migrate all callers from /get_server_info to /server_info (#21463) 2026-04-01 21:17:50 -07:00
Baizhou Zhang
5b19c9a05d [Doc] Update tips for developer new-comers (#21659)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
2026-03-29 22:40:36 -07:00
Lianmin Zheng
83997080a6 docs: flesh out MAINTAINER.md oncall lists and link GitHub profiles (#21575) 2026-03-27 17:39:16 -07:00
zwang86
5fc5c18bed fix(security): replace unsafe pickle.loads with SafeUnpickler for CVE-2026-3989 (#20904) 2026-03-27 00:43:41 -07:00
Jiaxin(Jackson) Deng
c4db64c16b Add Lychee Doc Links Check to Local and CI (#19742)
Co-authored-by: Zijie Xia <zijie_xia@icloud.com>
Co-authored-by: Zijie Xia <zijiexia@users.noreply.github.com>
Co-authored-by: zijiexia <37504505+zijiexia@users.noreply.github.com>
2026-03-24 13:48:26 -07:00
Lianmin Zheng
2d7a262ca3 ci: rename 1/2-gpu-runner labels to 1/2-gpu-h100 (#21008) 2026-03-20 06:04:15 -07:00
Xiaoyu Zhang
20a23e3173 [SKILL] Refine kernel authoring docs and validate add-jit-kernel / add-sgl-kernel end to end with Codex (#20867) 2026-03-18 23:00:33 +08:00
Ke Bao
c42da50289 Update test guide to contribution guide (#20805) 2026-03-18 13:25:16 +08:00
Xiaoyu Zhang
15097c5c3b Release sglang kernel 0.4.0 (#20440)
Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>
2026-03-16 20:34:58 +08:00
xingsy97
f8d4eb7022 [Docs] Add docstrings to JIT kernel include headers (#19770) 2026-03-07 20:48:00 +08:00
Alison Shao
2c856c6d27 Allow PR authors to use /rerun-failed-ci on their own PRs (#19496)
Co-authored-by: Alison Shao <alisonshao@MacBook-Pro-D2W773R9CD.local>
2026-02-27 10:14:57 -08:00
Julian Huang
a55f658835 [Misc] Normalize --host parameter to use plain hostname without scheme (#19309)
Co-authored-by: 墨楼 <huangzhilin.hzl@antgroup.com>
Co-authored-by: Liangsheng Yin <lsyincs@gmail.com>
Co-authored-by: Liangsheng Yin <hnyls2002@gmail.com>
2026-02-25 00:37:24 -08:00
nvjullin
3fe93b5493 Updated benchmark guide (#19243) 2026-02-24 21:11:17 -08:00
HAI
934b36693c Reasoning models fix docs (#18963) 2026-02-17 23:05:55 -08:00
rinbaro
de6a03260f [docs] fix misspellings & typos (#18276) 2026-02-05 03:35:29 +00:00
Xiaoyu Zhang
c08b54a575 [JIT kernel] Update jit_kernel cache and develop doc (#17842) 2026-01-28 15:09:47 +08:00
Hubert Lu
93423ff780 [AMD] Deprecate ROCm 6.3 artifacts and standardize gfx942 on ROCm 7 (#17785) 2026-01-27 15:58:49 -08:00
zijiexia
9f8b79f16f [Docs] Fix formatting in Evaluating New Models with SGLang (#17376) 2026-01-19 18:22:30 -08:00
zijiexia
79ddc34c1c [Docs] Add new model evaluation docs (#17043)
Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>
Co-authored-by: 赵晨阳 <zhaochen20@outlook.com>
2026-01-19 16:35:03 -08:00
Liangsheng Yin
77d3566555 Tiny fix wording about CI preemption. (#16773) 2026-01-09 11:08:17 +08:00
DarkSharpness
291f11ae39 [Minor] Enhance JIT kernel and add dev docs (#14570) 2025-12-23 22:34:59 +08:00
Baizhou Zhang
42fcf5438f Revert "tiny remove deprecated endpoint call" (#14533) 2025-12-05 23:48:54 -08:00
Alison Shao
e41664ba1a [Docs] Add /rerun-stage command to contribution guide (#14521) 2025-12-05 15:46:47 -08:00
b8zhong
ec7b2c16d9 tiny remove deprecated endpoint call (#13607) 2025-12-05 09:54:49 -08:00
sglang-bot
b5d3998508 Rename secrets.WHL_TOKEN -> secrets.GH_PAT_FOR_WHL_RELEASE (#14421)
Co-authored-by: Lianmin Zheng <lianminzheng@gmail.com>
2025-12-04 18:24:54 -08:00
Lianmin Zheng
8a7b1b8301 [Docs] Update CI docs (#14260) 2025-12-01 18:15:03 -08:00
Liangsheng Yin
19729f723e [CI] Align metric units for CI rate limit (#13633)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
2025-11-20 16:25:57 +08:00
Liangsheng Yin
109f27ba3a [CI] update pr-gate to be compatible with new slash triggering mananer. (#13522) 2025-11-19 00:49:13 +08:00
sglang-bot
c1a30aa765 Add /tag-and-rerun-ci (#13521) 2025-11-18 06:53:53 -08:00
Lianmin Zheng
2e1dbdb258 Update docs (#13519)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
2025-11-18 06:24:58 -08:00
Lianmin Zheng
6d025fd35b Trigger CI retry with edit (#13516)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
2025-11-18 05:42:31 -08:00
Lianmin Zheng
63807079b9 Add docs on trigger ci (#13513)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
2025-11-18 05:23:05 -08:00
Lianmin Zheng
e2d6746808 Add .github/CI_PERMISSIONS.json to define the CI permissions (#13509)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
2025-11-18 04:00:15 -08:00
Liangsheng Yin
4e41edcb9c [CI] remove auto-labeling run-ci label. (#13486) 2025-11-18 14:59:46 +08:00
Ying Sheng
15bc1f5cd7 Update .github/MAINTAINER.md (#13398)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
2025-11-16 21:32:24 -08:00
Lianmin Zheng
7e626d12b7 Update docs (#13391)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
2025-11-16 19:36:33 -08:00
kyleliang-nv
597d416070 [feature] Add layerwise NVTX support (#11870) 2025-11-15 19:20:56 -08:00
Kangyan-Zhou
6a3b9fd00f Update setup_github_runner.md 2025-11-02 20:44:09 -08:00
Kangyan-Zhou
ceb105a780 Update setup_github_runner.md 2025-10-25 09:22:00 -07:00
Qiaolin Yu
547003bdd0 fix command line usage of profiling (#11793) 2025-10-18 12:54:36 +08:00
Lianmin Zheng
b9a54e0968 [minor] sync code on python/sglang/test/test_deterministic.py and improve ci tests (#11777)
Co-authored-by: Stefan He <hebiaobuaa@gmail.com>
Co-authored-by: Byron Hsu <byronhsu1230@gmail.com>
2025-10-17 14:25:22 -07:00
Xiaoyu Zhang
88a6f9dab5 bench_serving support PD Disaggregation (#11542) 2025-10-13 19:43:26 -07:00
Neelabh Sinha
aaf7af1b17 [FEATURE] Add Profile Trace Merger for Distributed Traces (#11413) 2025-10-14 09:20:17 +08:00
Kevin Xiang Li
e3bb7f5ae6 benchmark: enhance configurable multimodal benchmarking in bench_serving (#9812)
Co-authored-by: Xiang (Kevin) Li <lik@nvidia.com>
Co-authored-by: Xinyuan Tong <115166877+JustinTong0323@users.noreply.github.com>
2025-10-08 01:31:36 -07:00
Lianmin Zheng
b1f0fc1c0b Add CI timeout guidelines (#10829) 2025-09-23 22:08:02 -07:00
Lianmin Zheng
50dc0c1e9c Run tests based on labels (#10456) 2025-09-15 00:29:20 -07:00
Teng Ma
a02071a12c [Bench] feat: mooncake trace integration (#9839)
Signed-off-by: Xuchun Shang <xuchun.shang@linux.alibaba.com>
Signed-off-by: Teng Ma <sima.mt@alibaba-inc.com>
Co-authored-by: Xuchun Shang <xuchun.shang@linux.alibaba.com>
2025-09-09 02:50:54 +08:00
yhyang201
a85363c199 [docs] Instructions for bench_serving.py (#9071)
Co-authored-by: Mick <mickjagger19@icloud.com>
Co-authored-by: Xinyuan Tong <115166877+JustinTong0323@users.noreply.github.com>
Co-authored-by: zhaochenyang20 <zhaochen20@outlook.com>
Co-authored-by: zhaochenyang20 <zhaochenyang20@gmail.com>
Co-authored-by: Yineng Zhang <me@zhyncs.com>
2025-08-26 18:30:57 -07:00