Commit Graph

14 Commits

Author SHA1 Message Date
Junlin Wu
80a6014243 [diffusion][npu][quant] Add MXFP8 quantization support for Wan2.2 Diffusion on Ascend NPU (#20922)
Co-authored-by: ronnie_zheng <zl19940307@163.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2026-05-07 21:30:56 +03:00
Xiaoyu Zhang
8c703f215e Add HunyuanVideo ModelOpt FP8 diffusion support (#23199) 2026-05-05 19:27:28 +08:00
Xiaoyu Zhang
f2d1390909 [Diffusion] Add Qwen Image ModelOpt FP8 support (#23155)
Co-authored-by: Mick <mickjagger19@icloud.com>
2026-05-04 00:24:22 +08:00
Xiaoyu Zhang
589f90b368 [diffusion] chore: use lmsys as org for modelopt checkpoints (#23924) 2026-05-02 17:18:58 +08:00
Xiaoyu Zhang
615d6c93b2 [codex] Add flashinfer TRTLLM backend for diffusion NVFP4 (#22717) 2026-04-18 09:06:28 +08:00
Xiaoyu Zhang
695ab705cb [diffusion] quant: update modelopt quantization docs and CI coverage (#22772) 2026-04-15 21:30:28 +08:00
Xiaoyu Zhang
f97c608caa [diffusion] quant: add FLUX.1-dev modelopt nvfp4 support (#22672) 2026-04-14 15:00:59 +08:00
Mick
bf022e177c Revert "[Diffusion] Add FLUX.1-dev ModelOpt NVFP4 support (#22574)" (#22649) 2026-04-13 11:17:32 +08:00
Xiaoyu Zhang
37fc47c645 diffusion: fix layerwise offload for ModelOpt quantized DiTs (#22594) 2026-04-13 08:01:54 +08:00
Xiaoyu Zhang
03a1a7b81c [Diffusion] Add FLUX.1-dev ModelOpt NVFP4 support (#22574) 2026-04-13 07:57:41 +08:00
Xiaoyu Zhang
1ff51555f2 [Diffusion] modelopt diffusion fp8 support for flux1/flux2 and wan2.2 (#22365) 2026-04-10 20:56:57 +08:00
Артем Савкин
27071e0a43 [NPU] Update quantization&CI documentation (#21100)
Co-authored-by: Tamir Baydasov <41994229+TamirBaydasov@users.noreply.github.com>
2026-03-28 21:42:21 +03:00
Mick
6425df5c8a [diffusion] doc: consolidate documentation (#21373) 2026-03-25 16:01:32 +08:00
Mick
6cc5717e8a [diffusion] doc: update quantization.md (#21356) 2026-03-25 14:48:38 +08:00