ai-toolkit

mirror of https://github.com/ostris/ai-toolkit.git synced 2026-05-12 00:42:07 +00:00

Files

M. Hofer f213e3b1e5 Fix FLUX2 Klein load-time VRAM spikes on low-memory GPUs. (#726 )

Keep the transformer and Qwen text encoder off CUDA during initial load/quantization in low-VRAM mode so model startup avoids full-model OOM before offloading and quantization can take effect.

Co-authored-by: Cursor <cursoragent@cursor.com>
Co-authored-by: Jaret Burkett <jaretburkett@gmail.com>

2026-04-01 09:36:55 -06:00

src

Fixed issue that prevented full fine-tuning of flux2 models when using gradient checkpointing

2026-02-06 16:18:43 -07:00

__init__.py

Add support for FLUX.2 klein base models

2026-01-17 17:46:25 -07:00

flux2_klein_model.py

Fix FLUX2 Klein load-time VRAM spikes on low-memory GPUs. (#726 )

2026-04-01 09:36:55 -06:00

flux2_model.py

Fix FLUX2 Klein load-time VRAM spikes on low-memory GPUs. (#726 )

2026-04-01 09:36:55 -06:00