ybyang
|
41258f874d
|
[PD]feat(bench): add --fake-prefill flag for decode-only stress testing (#22973)
|
2026-04-16 13:57:55 -07:00 |
|
Zaire
|
71377deda7
|
[Docs] fix profiling endpoint (#22982)
Signed-off-by: Zaire404 <3147879462@qq.com>
|
2026-04-16 12:51:39 -04:00 |
|
David Cheung
|
ed427e1299
|
Migrate all callers from /get_server_info to /server_info (#21463)
|
2026-04-01 21:17:50 -07:00 |
|
Baizhou Zhang
|
5b19c9a05d
|
[Doc] Update tips for developer new-comers (#21659)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
|
2026-03-29 22:40:36 -07:00 |
|
Lianmin Zheng
|
83997080a6
|
docs: flesh out MAINTAINER.md oncall lists and link GitHub profiles (#21575)
|
2026-03-27 17:39:16 -07:00 |
|
zwang86
|
5fc5c18bed
|
fix(security): replace unsafe pickle.loads with SafeUnpickler for CVE-2026-3989 (#20904)
|
2026-03-27 00:43:41 -07:00 |
|
Jiaxin(Jackson) Deng
|
c4db64c16b
|
Add Lychee Doc Links Check to Local and CI (#19742)
Co-authored-by: Zijie Xia <zijie_xia@icloud.com>
Co-authored-by: Zijie Xia <zijiexia@users.noreply.github.com>
Co-authored-by: zijiexia <37504505+zijiexia@users.noreply.github.com>
|
2026-03-24 13:48:26 -07:00 |
|
Lianmin Zheng
|
2d7a262ca3
|
ci: rename 1/2-gpu-runner labels to 1/2-gpu-h100 (#21008)
|
2026-03-20 06:04:15 -07:00 |
|
Xiaoyu Zhang
|
20a23e3173
|
[SKILL] Refine kernel authoring docs and validate add-jit-kernel / add-sgl-kernel end to end with Codex (#20867)
|
2026-03-18 23:00:33 +08:00 |
|
Ke Bao
|
c42da50289
|
Update test guide to contribution guide (#20805)
|
2026-03-18 13:25:16 +08:00 |
|
Xiaoyu Zhang
|
15097c5c3b
|
Release sglang kernel 0.4.0 (#20440)
Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>
|
2026-03-16 20:34:58 +08:00 |
|
xingsy97
|
f8d4eb7022
|
[Docs] Add docstrings to JIT kernel include headers (#19770)
|
2026-03-07 20:48:00 +08:00 |
|
Alison Shao
|
2c856c6d27
|
Allow PR authors to use /rerun-failed-ci on their own PRs (#19496)
Co-authored-by: Alison Shao <alisonshao@MacBook-Pro-D2W773R9CD.local>
|
2026-02-27 10:14:57 -08:00 |
|
Julian Huang
|
a55f658835
|
[Misc] Normalize --host parameter to use plain hostname without scheme (#19309)
Co-authored-by: 墨楼 <huangzhilin.hzl@antgroup.com>
Co-authored-by: Liangsheng Yin <lsyincs@gmail.com>
Co-authored-by: Liangsheng Yin <hnyls2002@gmail.com>
|
2026-02-25 00:37:24 -08:00 |
|
nvjullin
|
3fe93b5493
|
Updated benchmark guide (#19243)
|
2026-02-24 21:11:17 -08:00 |
|
HAI
|
934b36693c
|
Reasoning models fix docs (#18963)
|
2026-02-17 23:05:55 -08:00 |
|
rinbaro
|
de6a03260f
|
[docs] fix misspellings & typos (#18276)
|
2026-02-05 03:35:29 +00:00 |
|
Xiaoyu Zhang
|
c08b54a575
|
[JIT kernel] Update jit_kernel cache and develop doc (#17842)
|
2026-01-28 15:09:47 +08:00 |
|
Hubert Lu
|
93423ff780
|
[AMD] Deprecate ROCm 6.3 artifacts and standardize gfx942 on ROCm 7 (#17785)
|
2026-01-27 15:58:49 -08:00 |
|
zijiexia
|
9f8b79f16f
|
[Docs] Fix formatting in Evaluating New Models with SGLang (#17376)
|
2026-01-19 18:22:30 -08:00 |
|
zijiexia
|
79ddc34c1c
|
[Docs] Add new model evaluation docs (#17043)
Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>
Co-authored-by: 赵晨阳 <zhaochen20@outlook.com>
|
2026-01-19 16:35:03 -08:00 |
|
Liangsheng Yin
|
77d3566555
|
Tiny fix wording about CI preemption. (#16773)
|
2026-01-09 11:08:17 +08:00 |
|
DarkSharpness
|
291f11ae39
|
[Minor] Enhance JIT kernel and add dev docs (#14570)
|
2025-12-23 22:34:59 +08:00 |
|
Baizhou Zhang
|
42fcf5438f
|
Revert "tiny remove deprecated endpoint call" (#14533)
|
2025-12-05 23:48:54 -08:00 |
|
Alison Shao
|
e41664ba1a
|
[Docs] Add /rerun-stage command to contribution guide (#14521)
|
2025-12-05 15:46:47 -08:00 |
|
b8zhong
|
ec7b2c16d9
|
tiny remove deprecated endpoint call (#13607)
|
2025-12-05 09:54:49 -08:00 |
|
sglang-bot
|
b5d3998508
|
Rename secrets.WHL_TOKEN -> secrets.GH_PAT_FOR_WHL_RELEASE (#14421)
Co-authored-by: Lianmin Zheng <lianminzheng@gmail.com>
|
2025-12-04 18:24:54 -08:00 |
|
Lianmin Zheng
|
8a7b1b8301
|
[Docs] Update CI docs (#14260)
|
2025-12-01 18:15:03 -08:00 |
|
Liangsheng Yin
|
19729f723e
|
[CI] Align metric units for CI rate limit (#13633)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
|
2025-11-20 16:25:57 +08:00 |
|
Liangsheng Yin
|
109f27ba3a
|
[CI] update pr-gate to be compatible with new slash triggering mananer. (#13522)
|
2025-11-19 00:49:13 +08:00 |
|
sglang-bot
|
c1a30aa765
|
Add /tag-and-rerun-ci (#13521)
|
2025-11-18 06:53:53 -08:00 |
|
Lianmin Zheng
|
2e1dbdb258
|
Update docs (#13519)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
|
2025-11-18 06:24:58 -08:00 |
|
Lianmin Zheng
|
6d025fd35b
|
Trigger CI retry with edit (#13516)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
|
2025-11-18 05:42:31 -08:00 |
|
Lianmin Zheng
|
63807079b9
|
Add docs on trigger ci (#13513)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
|
2025-11-18 05:23:05 -08:00 |
|
Lianmin Zheng
|
e2d6746808
|
Add .github/CI_PERMISSIONS.json to define the CI permissions (#13509)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
|
2025-11-18 04:00:15 -08:00 |
|
Liangsheng Yin
|
4e41edcb9c
|
[CI] remove auto-labeling run-ci label. (#13486)
|
2025-11-18 14:59:46 +08:00 |
|
Ying Sheng
|
15bc1f5cd7
|
Update .github/MAINTAINER.md (#13398)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
|
2025-11-16 21:32:24 -08:00 |
|
Lianmin Zheng
|
7e626d12b7
|
Update docs (#13391)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
|
2025-11-16 19:36:33 -08:00 |
|
kyleliang-nv
|
597d416070
|
[feature] Add layerwise NVTX support (#11870)
|
2025-11-15 19:20:56 -08:00 |
|
Kangyan-Zhou
|
6a3b9fd00f
|
Update setup_github_runner.md
|
2025-11-02 20:44:09 -08:00 |
|
Kangyan-Zhou
|
ceb105a780
|
Update setup_github_runner.md
|
2025-10-25 09:22:00 -07:00 |
|
Qiaolin Yu
|
547003bdd0
|
fix command line usage of profiling (#11793)
|
2025-10-18 12:54:36 +08:00 |
|
Lianmin Zheng
|
b9a54e0968
|
[minor] sync code on python/sglang/test/test_deterministic.py and improve ci tests (#11777)
Co-authored-by: Stefan He <hebiaobuaa@gmail.com>
Co-authored-by: Byron Hsu <byronhsu1230@gmail.com>
|
2025-10-17 14:25:22 -07:00 |
|
Xiaoyu Zhang
|
88a6f9dab5
|
bench_serving support PD Disaggregation (#11542)
|
2025-10-13 19:43:26 -07:00 |
|
Neelabh Sinha
|
aaf7af1b17
|
[FEATURE] Add Profile Trace Merger for Distributed Traces (#11413)
|
2025-10-14 09:20:17 +08:00 |
|
Kevin Xiang Li
|
e3bb7f5ae6
|
benchmark: enhance configurable multimodal benchmarking in bench_serving (#9812)
Co-authored-by: Xiang (Kevin) Li <lik@nvidia.com>
Co-authored-by: Xinyuan Tong <115166877+JustinTong0323@users.noreply.github.com>
|
2025-10-08 01:31:36 -07:00 |
|
Lianmin Zheng
|
b1f0fc1c0b
|
Add CI timeout guidelines (#10829)
|
2025-09-23 22:08:02 -07:00 |
|
Lianmin Zheng
|
50dc0c1e9c
|
Run tests based on labels (#10456)
|
2025-09-15 00:29:20 -07:00 |
|
Teng Ma
|
a02071a12c
|
[Bench] feat: mooncake trace integration (#9839)
Signed-off-by: Xuchun Shang <xuchun.shang@linux.alibaba.com>
Signed-off-by: Teng Ma <sima.mt@alibaba-inc.com>
Co-authored-by: Xuchun Shang <xuchun.shang@linux.alibaba.com>
|
2025-09-09 02:50:54 +08:00 |
|
yhyang201
|
a85363c199
|
[docs] Instructions for bench_serving.py (#9071)
Co-authored-by: Mick <mickjagger19@icloud.com>
Co-authored-by: Xinyuan Tong <115166877+JustinTong0323@users.noreply.github.com>
Co-authored-by: zhaochenyang20 <zhaochen20@outlook.com>
Co-authored-by: zhaochenyang20 <zhaochenyang20@gmail.com>
Co-authored-by: Yineng Zhang <me@zhyncs.com>
|
2025-08-26 18:30:57 -07:00 |
|