Junlin Wu
|
80a6014243
|
✨ [diffusion][npu][quant] Add MXFP8 quantization support for Wan2.2 Diffusion on Ascend NPU (#20922)
Co-authored-by: ronnie_zheng <zl19940307@163.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
|
2026-05-07 21:30:56 +03:00 |
|
Xiaoyu Zhang
|
8c703f215e
|
Add HunyuanVideo ModelOpt FP8 diffusion support (#23199)
|
2026-05-05 19:27:28 +08:00 |
|
Xiaoyu Zhang
|
f2d1390909
|
[Diffusion] Add Qwen Image ModelOpt FP8 support (#23155)
Co-authored-by: Mick <mickjagger19@icloud.com>
|
2026-05-04 00:24:22 +08:00 |
|
Xiaoyu Zhang
|
589f90b368
|
[diffusion] chore: use lmsys as org for modelopt checkpoints (#23924)
|
2026-05-02 17:18:58 +08:00 |
|
Xiaoyu Zhang
|
615d6c93b2
|
[codex] Add flashinfer TRTLLM backend for diffusion NVFP4 (#22717)
|
2026-04-18 09:06:28 +08:00 |
|
Xiaoyu Zhang
|
695ab705cb
|
[diffusion] quant: update modelopt quantization docs and CI coverage (#22772)
|
2026-04-15 21:30:28 +08:00 |
|
Xiaoyu Zhang
|
f97c608caa
|
[diffusion] quant: add FLUX.1-dev modelopt nvfp4 support (#22672)
|
2026-04-14 15:00:59 +08:00 |
|
Mick
|
bf022e177c
|
Revert "[Diffusion] Add FLUX.1-dev ModelOpt NVFP4 support (#22574)" (#22649)
|
2026-04-13 11:17:32 +08:00 |
|
Xiaoyu Zhang
|
37fc47c645
|
diffusion: fix layerwise offload for ModelOpt quantized DiTs (#22594)
|
2026-04-13 08:01:54 +08:00 |
|
Xiaoyu Zhang
|
03a1a7b81c
|
[Diffusion] Add FLUX.1-dev ModelOpt NVFP4 support (#22574)
|
2026-04-13 07:57:41 +08:00 |
|
Xiaoyu Zhang
|
1ff51555f2
|
[Diffusion] modelopt diffusion fp8 support for flux1/flux2 and wan2.2 (#22365)
|
2026-04-10 20:56:57 +08:00 |
|
Артем Савкин
|
27071e0a43
|
[NPU] Update quantization&CI documentation (#21100)
Co-authored-by: Tamir Baydasov <41994229+TamirBaydasov@users.noreply.github.com>
|
2026-03-28 21:42:21 +03:00 |
|
Mick
|
6425df5c8a
|
[diffusion] doc: consolidate documentation (#21373)
|
2026-03-25 16:01:32 +08:00 |
|
Mick
|
6cc5717e8a
|
[diffusion] doc: update quantization.md (#21356)
|
2026-03-25 14:48:38 +08:00 |
|