mirror of
https://github.com/kvcache-ai/sglang.git
synced 2026-07-01 04:08:10 +00:00
Co-authored-by: AdityaVKochar <adityavardhankochar@gmail.com> Co-authored-by: mintlify[bot] <109931778+mintlify[bot]@users.noreply.github.com> Co-authored-by: adhyan-jain <adhyanjain2006@gmail.com> Co-authored-by: Adhyan Jain <71976554+adhyan-jain@users.noreply.github.com> Co-authored-by: Maitri-shah29 <maitrirajivshah@gmail.com> Co-authored-by: Adarsh Shirawalmath <114558126+adarshxs@users.noreply.github.com> Co-authored-by: Maitri Shah <shah29maitri@gmail.com> Co-authored-by: Aditya Vardhan Kochar <80113212+AdityaVKochar@users.noreply.github.com> Co-authored-by: Rishit Shivam <164783543+pokymono@users.noreply.github.com> Co-authored-by: Rishitshivam <164783543+Rishitshivam@users.noreply.github.com> Co-authored-by: IshhanKheria <ishhankheria06@gmail.com> Co-authored-by: Ishita Joshi <ishitata.joshi@gmail.com> Co-authored-by: Richard Chen <104477092+Richardczl98@users.noreply.github.com> Co-authored-by: longGGGGGG <553746008@qq.com> Co-authored-by: Richard <richardchen@radixark.ai> Co-authored-by: Nakul Sinha <nakul.new4socials@gmail.com> Co-authored-by: Divyam Agrawal <ludicrouslytrue@gmail.com> Co-authored-by: Richardczl98 <Zhenlinc@stanford.edu> Co-authored-by: Krishang Zinzuwadia <krishangzinzuwadia@gmail.com> Co-authored-by: nimeshas <nimesha.s106@gmail.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Jignas Paturu <86356085+JignasP@users.noreply.github.com> Co-authored-by: zijiexia <37504505+zijiexia@users.noreply.github.com>
19 lines
646 B
Plaintext
19 lines
646 B
Plaintext
---
|
|
title: Advanced Features
|
|
description: Advanced configuration, optimization, and deployment features for SGLang.
|
|
---
|
|
|
|
- [Server Arguments](./server_arguments)
|
|
- [Hyperparameter Tuning](./hyperparameter_tuning)
|
|
- [Attention Backend](./attention_backend)
|
|
- [Speculative Decoding](./speculative_decoding)
|
|
- [Structured Outputs](./structured_outputs)
|
|
- [Quantization](./quantization)
|
|
- [Expert Parallelism](./expert_parallelism)
|
|
- [LoRA](./lora)
|
|
- [PD Disaggregation](./pd_disaggregation)
|
|
- [Pipeline Parallelism](./pipeline_parallelism)
|
|
- [HiCache](./hicache_best_practices)
|
|
- [Observability](./observability)
|
|
- [And more…](./server_arguments)
|