Commit Graph

51 Commits

Author SHA1 Message Date
Po-Han Huang (NVIDIA)
ada52e5972 [Docs] Move ptxas sm_103a workaround into For CUDA 13 section (#22852)
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-14 22:30:21 -07:00
Артем Савкин
27071e0a43 [NPU] Update quantization&CI documentation (#21100)
Co-authored-by: Tamir Baydasov <41994229+TamirBaydasov@users.noreply.github.com>
2026-03-28 21:42:21 +03:00
Xiaoyu Zhang
15097c5c3b Release sglang kernel 0.4.0 (#20440)
Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>
2026-03-16 20:34:58 +08:00
Mohammad Miadh Angkad
1b76eb9361 [Doc] Update version references and add automation (#18409) 2026-03-04 09:51:46 -08:00
Mohammad Miadh Angkad
c47c2f9466 [Doc] Update CUDA 13 install guide to install torch first (#18404)
Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>
2026-02-07 18:04:37 +08:00
Baizhou Zhang
c7d53fa26a Set torch url index in pyproject.toml (#16802) 2026-02-01 13:23:52 +08:00
Lianmin Zheng
56e6652d1d Lazy import torchao (#17626) 2026-01-22 22:04:51 -08:00
Lianmin Zheng
b74a57a8d9 [Auto Sync] Update detokenizer_manager.py, io_struct.py, mu... (20260120) (#17442)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Wangfan Fu <wangfan@x.ai>
2026-01-21 14:48:32 -08:00
Mohammad Miadh Angkad
088758c1c1 [Tiny] Improve docs (#17264) 2026-01-18 14:57:01 +08:00
Mohammad Miadh Angkad
c771933dc5 [Doc] Tiny docs update for CUDA 13 (#17200) 2026-01-16 20:53:36 +08:00
Baizhou Zhang
8b99af9af8 [Doc] Tiny update Cuda 13 environment instructions (#17174) 2026-01-16 06:12:26 +08:00
Yi Zhong
7dde3438e2 Show how to use cu13 image with B300 (#17170)
Signed-off-by: vincentzed <207368749+vincentzed@users.noreply.github.com>
2026-01-15 15:25:20 -05:00
Baizhou Zhang
94fc26aad8 [Doc]Update note for Cuda 13 container usage (#16805) 2026-01-10 14:03:19 +08:00
Lianmin Zheng
d56d14e566 [Docs] Improve docs for install on gb200 (#16760) 2026-01-09 03:07:13 -08:00
Sirut Buasai
cceb5e6aa5 add AWS SGLang DLC to docs (#16686) 2026-01-08 20:06:31 -08:00
sglang-bot
5c8bd8b51b chore: bump SGLang version to 0.5.6.post2 (#14858)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
2025-12-11 12:29:52 -08:00
sglang-bot
9a327bdfcf chore: bump SGLang version to 0.5.6.post1 (#14651) 2025-12-09 00:35:28 +08:00
ishandhanani
498ea41ca6 dockerfile: add runtime stage + ubuntu 24.04 (#13861) 2025-12-05 00:28:36 -08:00
sglang-bot
7ae368efde chore: bump SGLang version to 0.5.6 (#14316)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
2025-12-02 17:17:13 -08:00
Sirut Buasai
a63f433b6f extend sagemaker.Dockerfile serve script to allow all sglang serve flags (#13173) 2025-11-17 13:14:17 -08:00
Lianmin Zheng
7e626d12b7 Update docs (#13391)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
2025-11-16 19:36:33 -08:00
sglang-bot
7b2fb3d47c chore: bump SGLang version to 0.5.5.post3 (#13366) 2025-11-16 17:55:38 -08:00
sglang-bot
ebaf86d441 chore: bump SGLang version to 0.5.5.post2 (#13129)
Include the critical fix https://github.com/sgl-project/sglang/pull/12915.
2025-11-12 20:35:20 +08:00
sglang-bot
303cc957e6 chore: bump SGLang version to 0.5.5.post1 (#13000) 2025-11-10 11:53:43 -08:00
sglang-bot
0c006b8809 chore: bump SGLang version to 0.5.5 (#12739) 2025-11-07 00:46:19 +08:00
sglang-bot
09938e1f82 chore: bump SGLang version to 0.5.4.post3 (#12639) 2025-11-04 18:32:11 -08:00
sglang-bot
41c10e67fc chore: bump SGLang version to 0.5.4.post2 (#12439) 2025-10-31 17:38:50 -07:00
ishandhanani
285a8e6986 docker: add CUDA13 support in dockerfile and update GDRCopy/NVSHMEM for blackwell support (#11517)
Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>
2025-10-27 22:00:54 -07:00
sglang-bot
55d75e11bd chore: bump SGLang version to 0.5.4.post1 (#12169) 2025-10-27 09:35:20 +08:00
Lianmin Zheng
4caca1ba04 Clean up server args & Add CI scripts (#12124) 2025-10-25 11:53:57 -07:00
sglang-bot
1053e1be17 chore: bump SGLang version to 0.5.4 (#12027)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
2025-10-23 18:01:40 -07:00
Lianmin Zheng
67e34c56d7 Fix install instructions and pyproject.tomls (#11781) 2025-10-18 01:08:01 -07:00
sglang-bot
85ebeecf06 chore: bump SGLang version to 0.5.3.post3 (#11693)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
2025-10-16 13:14:55 -07:00
sglang-bot
baf277a9bf chore: bump SGLang version to 0.5.3.post2 (#11680)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
2025-10-15 16:49:14 -07:00
Yineng Zhang
ab9187a20b docs: update sglang installation guide (#11659) 2025-10-15 00:35:48 -07:00
Lianmin Zheng
d314bf6010 Update install.md (#11631) 2025-10-14 14:34:46 -07:00
sglang-bot
758b887ad1 chore: bump SGLang version to 0.5.3.post1 (#11324) 2025-10-09 15:19:59 -07:00
sglang-bot
a4a3d82393 chore: bump SGLang version to 0.5.3 (#11263) 2025-10-06 20:07:02 +08:00
sglang-bot
0b13cbb7c9 chore: bump SGLang version to 0.5.3rc2 (#11259)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
2025-10-06 01:12:10 -07:00
Lianmin Zheng
d645ae90a3 Rename runner labels (#11228) 2025-10-05 18:05:41 -07:00
Yineng Zhang
86a32bb5cd chore: bump v0.5.3rc0 (#10468) 2025-09-15 03:55:18 -07:00
Yineng Zhang
b0d25e72c4 chore: bump v0.5.2 (#10221) 2025-09-11 16:09:20 -07:00
Yineng Zhang
fa9c82d339 chore: bump v0.5.2rc2 (#10050) 2025-09-04 20:07:27 -07:00
Yineng Zhang
18f91eb639 chore: bump v0.5.2rc1 (#9920) 2025-09-02 04:43:34 -07:00
Yineng Zhang
16e56ea693 chore: bump v0.5.2rc0 (#9862) 2025-09-01 03:07:36 -07:00
Yineng Zhang
bc80dc4ce0 chore: bump v0.5.1.post3 (#9716) 2025-08-27 15:42:42 -07:00
Yineng Zhang
e3e97a120b chore: bump v0.5.1.post2 (#9592) 2025-08-25 03:45:09 -07:00
Yineng Zhang
e0ab167db0 chore: bump v0.5.1.post1 (#9558) 2025-08-24 01:14:17 -07:00
Lianmin Zheng
97a38ee85b Release 0.5.1 (#9533) 2025-08-23 07:09:26 -07:00
Yineng Zhang
fab0f6e77d chore: bump v0.5.0rc2 (#9203) 2025-08-14 16:11:16 -07:00