Commit Graph

40 Commits

Author SHA1 Message Date
Mick
0d94c3366a [diffusion] feat: introduce ltx-2-two-stage device manager (#22869) 2026-04-18 11:04:33 +08:00
Xiaoyu Zhang
615d6c93b2 [codex] Add flashinfer TRTLLM backend for diffusion NVFP4 (#22717) 2026-04-18 09:06:28 +08:00
Mick
0b2058853d [diffusion] doc: update doc (#23052) 2026-04-17 16:23:46 +08:00
Yuhao Yang
9da998a882 [diffusion] feat: disaggregated diffusion (#21701) 2026-04-16 23:51:32 +08:00
Xiaoyu Zhang
695ab705cb [diffusion] quant: update modelopt quantization docs and CI coverage (#22772) 2026-04-15 21:30:28 +08:00
Xiaoyu Zhang
f97c608caa [diffusion] quant: add FLUX.1-dev modelopt nvfp4 support (#22672) 2026-04-14 15:00:59 +08:00
Mick
bf022e177c Revert "[Diffusion] Add FLUX.1-dev ModelOpt NVFP4 support (#22574)" (#22649) 2026-04-13 11:17:32 +08:00
Xiaoyu Zhang
37fc47c645 diffusion: fix layerwise offload for ModelOpt quantized DiTs (#22594) 2026-04-13 08:01:54 +08:00
Xiaoyu Zhang
03a1a7b81c [Diffusion] Add FLUX.1-dev ModelOpt NVFP4 support (#22574) 2026-04-13 07:57:41 +08:00
Mick
495ef8ec64 [diffusion] model: support LTX2.3 two stage (#22182) 2026-04-12 22:15:57 +08:00
Xiaoyu Zhang
1ff51555f2 [Diffusion] modelopt diffusion fp8 support for flux1/flux2 and wan2.2 (#22365) 2026-04-10 20:56:57 +08:00
Mick
9709192ce9 [diffusion] feat: support FLUX.2-small-decoder (#22414) 2026-04-09 15:53:14 +08:00
Mick
82c41a2d9e [diffusion] model: support LTX2.3 (#22111) 2026-04-06 12:26:30 +08:00
Prozac614
db3d4f4b76 [diffusion] model: support two stage pipeline of LTX-2 (#20707)
Co-authored-by: daiweitao <dwti614707404@163.com>
Co-authored-by: Mick <mickjagger19@icloud.com>
Co-authored-by: GMI Xiao Jin <xiao.j@gmicloud.ai>
2026-04-04 09:37:28 +08:00
yuefeng Wu
a20d12ae96 [diffusion][doc]: add ring sp performance benchmark page (#20998)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
2026-03-30 20:26:05 +03:00
Makcum888e
f4b0e9c64a [diffusion] [NPU] support ring attention on NPU with FA (#21383) 2026-03-30 20:10:55 +03:00
Mick
b76730701b [diffusion] feat: enhance overlay mechanism (#21648) 2026-03-30 19:45:34 +08:00
Артем Савкин
27071e0a43 [NPU] Update quantization&CI documentation (#21100)
Co-authored-by: Tamir Baydasov <41994229+TamirBaydasov@users.noreply.github.com>
2026-03-28 21:42:21 +03:00
Mick
fc9de157f9 [diffusion] feat: support overlay model materialization (#21600) 2026-03-28 23:02:38 +08:00
Mick
6425df5c8a [diffusion] doc: consolidate documentation (#21373) 2026-03-25 16:01:32 +08:00
Mick
6cc5717e8a [diffusion] doc: update quantization.md (#21356) 2026-03-25 14:48:38 +08:00
Jiaxin(Jackson) Deng
c4db64c16b Add Lychee Doc Links Check to Local and CI (#19742)
Co-authored-by: Zijie Xia <zijie_xia@icloud.com>
Co-authored-by: Zijie Xia <zijiexia@users.noreply.github.com>
Co-authored-by: zijiexia <37504505+zijiexia@users.noreply.github.com>
2026-03-24 13:48:26 -07:00
Xiaoyu Zhang
766d225fcc Add SGLang CUDA crash API logging inspired by FlashInfer (#20910) 2026-03-22 16:39:40 +08:00
DefTruth
025691cd9e [diffusion] chore: bump up cache-dit & support quant for diffusers backend (#20361) 2026-03-17 12:51:31 +08:00
R0CKSTAR
dae5c6cadf [diffusion] doc: add Moore Threads as a supported vendor (#20146)
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
2026-03-11 10:15:15 -07:00
Aleksi Vesanto
c8bbe5010a [diffusion] feat: add AITER Sage attention backend (#20178) 2026-03-11 12:17:45 +08:00
R0CKSTAR
db97f193b7 [diffusion][llm] macOS support (#19549)
Signed-off-by: Xiaodong Ye <yeahdongcn@gmail.com>
Co-authored-by: Mick <mickjagger19@icloud.com>
2026-03-10 13:11:07 -07:00
Yuhao Yang
57f28fda90 [diffusion] chore: add diffusion new model skill (#19605) 2026-03-09 09:45:23 +08:00
Junhao Liu
7662b8b919 [diffusion] feat: implement upscaling (#19723) 2026-03-09 02:06:40 +08:00
DefTruth
e9b5706545 [diffusion] feat: support torch compile for diffusers backend (#19673) 2026-03-04 14:08:45 +08:00
Mick
d098c8dab0 [diffusion] add .claude and update contributing with attitude towards vibe-pr (#19511) 2026-03-01 14:41:55 +08:00
Junhao Liu
53c767d224 [diffusion] Postprocess: implement frame interpolation using RIFE (#19384) 2026-02-28 14:13:20 +08:00
joesun
ca5f2e2ed1 [diffusion] fix: Support default response_format=url in /v1/images/generations to avoid 400 errors when response_format is omitted (#19360)
Co-authored-by: Makcum888e <79456407+Makcum888e@users.noreply.github.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
2026-02-27 19:47:38 +08:00
DefTruth
78d6674c45 [diffusion] feat: support hybrid parallelism for diffusers backend (#19405) 2026-02-27 00:06:08 +08:00
Mick
6503f94211 [diffusion] feat: support passing component path via server args (#19108) 2026-02-21 21:22:47 +08:00
Mick
b89ca65789 [diffusion] refactor: reduce redundancy and improve stage api (#19060) 2026-02-21 16:35:47 +08:00
Makcum888e
14c95d255c [Diffusion] [NPU] [Doc] Add NPU documentation for sglang-diffusion (#18894)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
2026-02-17 10:12:20 +03:00
Xinwei Qiang
356e338607 [diffusion] feat: support SparseVideoGen2 attention backend (#17507)
Co-authored-by: Mick <mickjagger19@icloud.com>
2026-02-13 16:20:46 +08:00
HuangJi
f4d80f9d42 [diffusion] feat: allows quality adjustment of generated images/videos (#17937) 2026-02-13 11:56:20 +08:00
qianyue76
f06ab17a73 [diffusion] docs: consolidate diffusion documentation into docs (#18095)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: JiaxinD <djx2048@gmail.com>
2026-02-11 16:55:07 -08:00