Junlin Wu
|
80a6014243
|
✨ [diffusion][npu][quant] Add MXFP8 quantization support for Wan2.2 Diffusion on Ascend NPU (#20922)
Co-authored-by: ronnie_zheng <zl19940307@163.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
|
2026-05-07 21:30:56 +03:00 |
|
Xiaoyu Zhang
|
8c703f215e
|
Add HunyuanVideo ModelOpt FP8 diffusion support (#23199)
|
2026-05-05 19:27:28 +08:00 |
|
Mick
|
2f7d99b7f7
|
[diffusion] cli: support component attention backend overrides (#24320)
|
2026-05-05 08:39:27 +08:00 |
|
Xiaoyu Zhang
|
f2d1390909
|
[Diffusion] Add Qwen Image ModelOpt FP8 support (#23155)
Co-authored-by: Mick <mickjagger19@icloud.com>
|
2026-05-04 00:24:22 +08:00 |
|
Xiaoyu Zhang
|
589f90b368
|
[diffusion] chore: use lmsys as org for modelopt checkpoints (#23924)
|
2026-05-02 17:18:58 +08:00 |
|
Mick
|
cd1fa7506a
|
[diffusion] model: support LTX2.3 high quality pipeline (#23366)
|
2026-04-24 14:18:20 +08:00 |
|
Mick
|
0d94c3366a
|
[diffusion] feat: introduce ltx-2-two-stage device manager (#22869)
|
2026-04-18 11:04:33 +08:00 |
|
Xiaoyu Zhang
|
615d6c93b2
|
[codex] Add flashinfer TRTLLM backend for diffusion NVFP4 (#22717)
|
2026-04-18 09:06:28 +08:00 |
|
Mick
|
0b2058853d
|
[diffusion] doc: update doc (#23052)
|
2026-04-17 16:23:46 +08:00 |
|
Yuhao Yang
|
9da998a882
|
[diffusion] feat: disaggregated diffusion (#21701)
|
2026-04-16 23:51:32 +08:00 |
|
Xiaoyu Zhang
|
695ab705cb
|
[diffusion] quant: update modelopt quantization docs and CI coverage (#22772)
|
2026-04-15 21:30:28 +08:00 |
|
Xiaoyu Zhang
|
f97c608caa
|
[diffusion] quant: add FLUX.1-dev modelopt nvfp4 support (#22672)
|
2026-04-14 15:00:59 +08:00 |
|
Mick
|
bf022e177c
|
Revert "[Diffusion] Add FLUX.1-dev ModelOpt NVFP4 support (#22574)" (#22649)
|
2026-04-13 11:17:32 +08:00 |
|
Xiaoyu Zhang
|
37fc47c645
|
diffusion: fix layerwise offload for ModelOpt quantized DiTs (#22594)
|
2026-04-13 08:01:54 +08:00 |
|
Xiaoyu Zhang
|
03a1a7b81c
|
[Diffusion] Add FLUX.1-dev ModelOpt NVFP4 support (#22574)
|
2026-04-13 07:57:41 +08:00 |
|
Mick
|
495ef8ec64
|
[diffusion] model: support LTX2.3 two stage (#22182)
|
2026-04-12 22:15:57 +08:00 |
|
Xiaoyu Zhang
|
1ff51555f2
|
[Diffusion] modelopt diffusion fp8 support for flux1/flux2 and wan2.2 (#22365)
|
2026-04-10 20:56:57 +08:00 |
|
Mick
|
9709192ce9
|
[diffusion] feat: support FLUX.2-small-decoder (#22414)
|
2026-04-09 15:53:14 +08:00 |
|
Mick
|
82c41a2d9e
|
[diffusion] model: support LTX2.3 (#22111)
|
2026-04-06 12:26:30 +08:00 |
|
Prozac614
|
db3d4f4b76
|
[diffusion] model: support two stage pipeline of LTX-2 (#20707)
Co-authored-by: daiweitao <dwti614707404@163.com>
Co-authored-by: Mick <mickjagger19@icloud.com>
Co-authored-by: GMI Xiao Jin <xiao.j@gmicloud.ai>
|
2026-04-04 09:37:28 +08:00 |
|
yuefeng Wu
|
a20d12ae96
|
[diffusion][doc]: add ring sp performance benchmark page (#20998)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
|
2026-03-30 20:26:05 +03:00 |
|
Makcum888e
|
f4b0e9c64a
|
[diffusion] [NPU] support ring attention on NPU with FA (#21383)
|
2026-03-30 20:10:55 +03:00 |
|
Mick
|
b76730701b
|
[diffusion] feat: enhance overlay mechanism (#21648)
|
2026-03-30 19:45:34 +08:00 |
|
Артем Савкин
|
27071e0a43
|
[NPU] Update quantization&CI documentation (#21100)
Co-authored-by: Tamir Baydasov <41994229+TamirBaydasov@users.noreply.github.com>
|
2026-03-28 21:42:21 +03:00 |
|
Mick
|
fc9de157f9
|
[diffusion] feat: support overlay model materialization (#21600)
|
2026-03-28 23:02:38 +08:00 |
|
Mick
|
6425df5c8a
|
[diffusion] doc: consolidate documentation (#21373)
|
2026-03-25 16:01:32 +08:00 |
|
Mick
|
6cc5717e8a
|
[diffusion] doc: update quantization.md (#21356)
|
2026-03-25 14:48:38 +08:00 |
|
Jiaxin(Jackson) Deng
|
c4db64c16b
|
Add Lychee Doc Links Check to Local and CI (#19742)
Co-authored-by: Zijie Xia <zijie_xia@icloud.com>
Co-authored-by: Zijie Xia <zijiexia@users.noreply.github.com>
Co-authored-by: zijiexia <37504505+zijiexia@users.noreply.github.com>
|
2026-03-24 13:48:26 -07:00 |
|
Xiaoyu Zhang
|
766d225fcc
|
Add SGLang CUDA crash API logging inspired by FlashInfer (#20910)
|
2026-03-22 16:39:40 +08:00 |
|
DefTruth
|
025691cd9e
|
[diffusion] chore: bump up cache-dit & support quant for diffusers backend (#20361)
|
2026-03-17 12:51:31 +08:00 |
|
R0CKSTAR
|
dae5c6cadf
|
[diffusion] doc: add Moore Threads as a supported vendor (#20146)
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
|
2026-03-11 10:15:15 -07:00 |
|
Aleksi Vesanto
|
c8bbe5010a
|
[diffusion] feat: add AITER Sage attention backend (#20178)
|
2026-03-11 12:17:45 +08:00 |
|
R0CKSTAR
|
db97f193b7
|
[diffusion][llm] macOS support (#19549)
Signed-off-by: Xiaodong Ye <yeahdongcn@gmail.com>
Co-authored-by: Mick <mickjagger19@icloud.com>
|
2026-03-10 13:11:07 -07:00 |
|
Yuhao Yang
|
57f28fda90
|
[diffusion] chore: add diffusion new model skill (#19605)
|
2026-03-09 09:45:23 +08:00 |
|
Junhao Liu
|
7662b8b919
|
[diffusion] feat: implement upscaling (#19723)
|
2026-03-09 02:06:40 +08:00 |
|
DefTruth
|
e9b5706545
|
[diffusion] feat: support torch compile for diffusers backend (#19673)
|
2026-03-04 14:08:45 +08:00 |
|
Mick
|
d098c8dab0
|
[diffusion] add .claude and update contributing with attitude towards vibe-pr (#19511)
|
2026-03-01 14:41:55 +08:00 |
|
Junhao Liu
|
53c767d224
|
[diffusion] Postprocess: implement frame interpolation using RIFE (#19384)
|
2026-02-28 14:13:20 +08:00 |
|
joesun
|
ca5f2e2ed1
|
[diffusion] fix: Support default response_format=url in /v1/images/generations to avoid 400 errors when response_format is omitted (#19360)
Co-authored-by: Makcum888e <79456407+Makcum888e@users.noreply.github.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
|
2026-02-27 19:47:38 +08:00 |
|
DefTruth
|
78d6674c45
|
[diffusion] feat: support hybrid parallelism for diffusers backend (#19405)
|
2026-02-27 00:06:08 +08:00 |
|
Mick
|
6503f94211
|
[diffusion] feat: support passing component path via server args (#19108)
|
2026-02-21 21:22:47 +08:00 |
|
Mick
|
b89ca65789
|
[diffusion] refactor: reduce redundancy and improve stage api (#19060)
|
2026-02-21 16:35:47 +08:00 |
|
Makcum888e
|
14c95d255c
|
[Diffusion] [NPU] [Doc] Add NPU documentation for sglang-diffusion (#18894)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
|
2026-02-17 10:12:20 +03:00 |
|
Xinwei Qiang
|
356e338607
|
[diffusion] feat: support SparseVideoGen2 attention backend (#17507)
Co-authored-by: Mick <mickjagger19@icloud.com>
|
2026-02-13 16:20:46 +08:00 |
|
HuangJi
|
f4d80f9d42
|
[diffusion] feat: allows quality adjustment of generated images/videos (#17937)
|
2026-02-13 11:56:20 +08:00 |
|
qianyue76
|
f06ab17a73
|
[diffusion] docs: consolidate diffusion documentation into docs (#18095)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: JiaxinD <djx2048@gmail.com>
|
2026-02-11 16:55:07 -08:00 |
|