Logo
Explore Help
Register Sign In
kvcache-ai/sglang
1
0
Fork 0
You've already forked sglang
mirror of https://github.com/kvcache-ai/sglang.git synced 2026-07-01 04:08:10 +00:00
Code Issues Packages Projects Releases Wiki Activity
11,824 Commits 157 Branches 0 Tags
cfd49e233ceb94de3f702f70aef06e299836aede
Commit Graph

10 Commits

Author SHA1 Message Date
Xiaoyu Zhang
615d6c93b2 [codex] Add flashinfer TRTLLM backend for diffusion NVFP4 (#22717) 2026-04-18 09:06:28 +08:00
Xiaoyu Zhang
695ab705cb [diffusion] quant: update modelopt quantization docs and CI coverage (#22772) 2026-04-15 21:30:28 +08:00
Xiaoyu Zhang
f97c608caa [diffusion] quant: add FLUX.1-dev modelopt nvfp4 support (#22672) 2026-04-14 15:00:59 +08:00
Mick
bf022e177c Revert "[Diffusion] Add FLUX.1-dev ModelOpt NVFP4 support (#22574)" (#22649) 2026-04-13 11:17:32 +08:00
Xiaoyu Zhang
37fc47c645 diffusion: fix layerwise offload for ModelOpt quantized DiTs (#22594) 2026-04-13 08:01:54 +08:00
Xiaoyu Zhang
03a1a7b81c [Diffusion] Add FLUX.1-dev ModelOpt NVFP4 support (#22574) 2026-04-13 07:57:41 +08:00
Xiaoyu Zhang
1ff51555f2 [Diffusion] modelopt diffusion fp8 support for flux1/flux2 and wan2.2 (#22365) 2026-04-10 20:56:57 +08:00
Артем Савкин
27071e0a43 [NPU] Update quantization&CI documentation (#21100)
Co-authored-by: Tamir Baydasov <41994229+TamirBaydasov@users.noreply.github.com>
2026-03-28 21:42:21 +03:00
Mick
6425df5c8a [diffusion] doc: consolidate documentation (#21373) 2026-03-25 16:01:32 +08:00
Mick
6cc5717e8a [diffusion] doc: update quantization.md (#21356) 2026-03-25 14:48:38 +08:00
Powered by Gitea Version: 1.25.4 Page: 341ms Template: 8ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API