Commit Graph

42 Commits

Author SHA1 Message Date
HAI
934b36693c Reasoning models fix docs (#18963) 2026-02-17 23:05:55 -08:00
rinbaro
de6a03260f [docs] fix misspellings & typos (#18276) 2026-02-05 03:35:29 +00:00
Xiaoyu Zhang
c08b54a575 [JIT kernel] Update jit_kernel cache and develop doc (#17842) 2026-01-28 15:09:47 +08:00
Hubert Lu
93423ff780 [AMD] Deprecate ROCm 6.3 artifacts and standardize gfx942 on ROCm 7 (#17785) 2026-01-27 15:58:49 -08:00
zijiexia
9f8b79f16f [Docs] Fix formatting in Evaluating New Models with SGLang (#17376) 2026-01-19 18:22:30 -08:00
zijiexia
79ddc34c1c [Docs] Add new model evaluation docs (#17043)
Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>
Co-authored-by: 赵晨阳 <zhaochen20@outlook.com>
2026-01-19 16:35:03 -08:00
Liangsheng Yin
77d3566555 Tiny fix wording about CI preemption. (#16773) 2026-01-09 11:08:17 +08:00
DarkSharpness
291f11ae39 [Minor] Enhance JIT kernel and add dev docs (#14570) 2025-12-23 22:34:59 +08:00
Baizhou Zhang
42fcf5438f Revert "tiny remove deprecated endpoint call" (#14533) 2025-12-05 23:48:54 -08:00
Alison Shao
e41664ba1a [Docs] Add /rerun-stage command to contribution guide (#14521) 2025-12-05 15:46:47 -08:00
b8zhong
ec7b2c16d9 tiny remove deprecated endpoint call (#13607) 2025-12-05 09:54:49 -08:00
sglang-bot
b5d3998508 Rename secrets.WHL_TOKEN -> secrets.GH_PAT_FOR_WHL_RELEASE (#14421)
Co-authored-by: Lianmin Zheng <lianminzheng@gmail.com>
2025-12-04 18:24:54 -08:00
Lianmin Zheng
8a7b1b8301 [Docs] Update CI docs (#14260) 2025-12-01 18:15:03 -08:00
Liangsheng Yin
19729f723e [CI] Align metric units for CI rate limit (#13633)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
2025-11-20 16:25:57 +08:00
Liangsheng Yin
109f27ba3a [CI] update pr-gate to be compatible with new slash triggering mananer. (#13522) 2025-11-19 00:49:13 +08:00
sglang-bot
c1a30aa765 Add /tag-and-rerun-ci (#13521) 2025-11-18 06:53:53 -08:00
Lianmin Zheng
2e1dbdb258 Update docs (#13519)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
2025-11-18 06:24:58 -08:00
Lianmin Zheng
6d025fd35b Trigger CI retry with edit (#13516)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
2025-11-18 05:42:31 -08:00
Lianmin Zheng
63807079b9 Add docs on trigger ci (#13513)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
2025-11-18 05:23:05 -08:00
Lianmin Zheng
e2d6746808 Add .github/CI_PERMISSIONS.json to define the CI permissions (#13509)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
2025-11-18 04:00:15 -08:00
Liangsheng Yin
4e41edcb9c [CI] remove auto-labeling run-ci label. (#13486) 2025-11-18 14:59:46 +08:00
Ying Sheng
15bc1f5cd7 Update .github/MAINTAINER.md (#13398)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
2025-11-16 21:32:24 -08:00
Lianmin Zheng
7e626d12b7 Update docs (#13391)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
2025-11-16 19:36:33 -08:00
kyleliang-nv
597d416070 [feature] Add layerwise NVTX support (#11870) 2025-11-15 19:20:56 -08:00
Kangyan-Zhou
6a3b9fd00f Update setup_github_runner.md 2025-11-02 20:44:09 -08:00
Kangyan-Zhou
ceb105a780 Update setup_github_runner.md 2025-10-25 09:22:00 -07:00
Qiaolin Yu
547003bdd0 fix command line usage of profiling (#11793) 2025-10-18 12:54:36 +08:00
Lianmin Zheng
b9a54e0968 [minor] sync code on python/sglang/test/test_deterministic.py and improve ci tests (#11777)
Co-authored-by: Stefan He <hebiaobuaa@gmail.com>
Co-authored-by: Byron Hsu <byronhsu1230@gmail.com>
2025-10-17 14:25:22 -07:00
Xiaoyu Zhang
88a6f9dab5 bench_serving support PD Disaggregation (#11542) 2025-10-13 19:43:26 -07:00
Neelabh Sinha
aaf7af1b17 [FEATURE] Add Profile Trace Merger for Distributed Traces (#11413) 2025-10-14 09:20:17 +08:00
Kevin Xiang Li
e3bb7f5ae6 benchmark: enhance configurable multimodal benchmarking in bench_serving (#9812)
Co-authored-by: Xiang (Kevin) Li <lik@nvidia.com>
Co-authored-by: Xinyuan Tong <115166877+JustinTong0323@users.noreply.github.com>
2025-10-08 01:31:36 -07:00
Lianmin Zheng
b1f0fc1c0b Add CI timeout guidelines (#10829) 2025-09-23 22:08:02 -07:00
Lianmin Zheng
50dc0c1e9c Run tests based on labels (#10456) 2025-09-15 00:29:20 -07:00
Teng Ma
a02071a12c [Bench] feat: mooncake trace integration (#9839)
Signed-off-by: Xuchun Shang <xuchun.shang@linux.alibaba.com>
Signed-off-by: Teng Ma <sima.mt@alibaba-inc.com>
Co-authored-by: Xuchun Shang <xuchun.shang@linux.alibaba.com>
2025-09-09 02:50:54 +08:00
yhyang201
a85363c199 [docs] Instructions for bench_serving.py (#9071)
Co-authored-by: Mick <mickjagger19@icloud.com>
Co-authored-by: Xinyuan Tong <115166877+JustinTong0323@users.noreply.github.com>
Co-authored-by: zhaochenyang20 <zhaochen20@outlook.com>
Co-authored-by: zhaochenyang20 <zhaochenyang20@gmail.com>
Co-authored-by: Yineng Zhang <me@zhyncs.com>
2025-08-26 18:30:57 -07:00
Lianmin Zheng
1ec9769753 [Docs] Update contribution guide (#9383) 2025-08-19 23:37:45 -07:00
Lianmin Zheng
ecc9f3e47a [Minor] Fix the style of sgl-kernel (#9332) 2025-08-18 23:45:00 -07:00
Lianmin Zheng
c480a3f6ea Minor style fixes for sgl-kernel (#9289) 2025-08-18 09:38:35 -07:00
Yineng Zhang
fab0f6e77d chore: bump v0.5.0rc2 (#9203) 2025-08-14 16:11:16 -07:00
Lianmin Zheng
9e426466af Clean up allocators (#9134)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
2025-08-13 13:56:04 -07:00
Lianmin Zheng
2e8e7e353b Improve docs and developer guide (#9044) 2025-08-10 21:05:18 -07:00
Lianmin Zheng
2449a0afe2 Refactor the docs (#9031) 2025-08-10 19:49:45 -07:00