Commit Graph

996 Commits

Author SHA1 Message Date
Jaret Burkett
5d5a8ef9da Fixed issue with deleting datasets and jobs with newer version of node.js. Bumped minimum version of node js to 20 2026-03-25 10:04:36 -06:00
Jaret Burkett
dfde30f231 Fix issue with ltx2 custom te repo path 2026-03-25 09:50:18 -06:00
Jaret Burkett
b8000dbcbc Bump version 2026-03-25 08:18:42 -06:00
Rodrigo Reis
54f4732c9b Fix the bug in temporal_compression data loader (#754) 2026-03-25 08:16:44 -06:00
Jaret Burkett
7f3309b291 Add support for audo frame count so datasets can have varrying length videos. Varous ltx 2.3 VAE optimizations such as removing tiling articacts, and doing frame split encoding to reduce vram on encoding/decoding. 2026-03-24 12:20:09 -06:00
Rasmus Lerdorf
4ad14d211a Add an import config button (#733) 2026-03-23 15:41:27 -06:00
Remix
7a0bbca5b1 Fix random_noise_multiplier (#738)
Apply random_noise_multiplier to noise.
2026-03-23 15:22:16 -06:00
Rayane
99a4a5887b Fix Qwen attention mask crash with diffusers >=0.37 (#748)
* Fix Qwen Image mask handling

* Fix Qwen attention mask crash with diffusers >=0.37

diffusers v0.37 (PR #12987) optimizes all-ones attention masks to None
in encode_prompt() when there is no padding. This breaks ai-toolkit's
Qwen extensions which call .to() on the mask unconditionally.

Fix: reconstruct the all-ones mask at the boundary (get_prompt_embeds)
right after encode_prompt() returns. This keeps the rest of the code
unchanged and works with both old and new diffusers versions.

Also removes redundant duplicate mask assignments in qwen_image_edit
and qwen_image_edit_plus.

Fixes #740
2026-03-23 14:43:08 -06:00
Jaret Burkett
295094b4b5 Fixed new breaking change in diffusers with with qwen image 2026-03-23 14:10:55 -06:00
Jaret Burkett
5642b656b9 Fix audio issues with ltx2 models. Silent codec fails now raised. Auto convert surround sound audio to stereo. Invalidate old caches just to be safe so they recache now. 2026-03-23 20:08:33 +00:00
Jaret Burkett
561e6f201c Fixed an issue with ltx 2.3 i2v training 2026-03-23 12:41:18 -06:00
Jaret Burkett
330059d8a1 version bump 2026-03-23 11:01:16 -06:00
Jaret Burkett
e91827f9be Change gemma repo to lightricks one that is not gated 2026-03-23 11:00:32 -06:00
Jaret Burkett
253cb31362 Fix issue with video and images with no audio on ltx models 2026-03-22 22:09:23 -06:00
Jaret Burkett
4a3d317e2b Fix issue with using the default text encoder with ltx 2.3 2026-03-22 18:53:59 -06:00
Jaret Burkett
859635e95b Add support for training LTX 2.3 (#745)
* Initial support for ltx 2.3. Still needs a lot of testing to make sure it is all right.

* bump version

* Handle lora renaming keys for new ltx 2.3 layers
2026-03-22 17:56:59 -06:00
Jaret Burkett
7e1fdc3844 Remove the 0.1 floor for amplification 2026-03-22 09:01:58 -06:00
Jaret Burkett
0f075fc45e Adjust signal amplification target. Allow signal amplification strength in config. 2026-03-22 08:30:13 -06:00
Jaret Burkett
dcd98dc0d5 Add signal amplification 2026-03-21 07:44:18 -06:00
Jaret Burkett
35b1cde3cb Fixed issue on z-image that prevented training at a larger batch size 2026-03-10 15:43:25 -06:00
Jaret Burkett
4909b809c7 Fixed issue with audio loss multiplier. 2026-03-10 15:16:09 -06:00
Jaret Burkett
06ef3d343a add ability to use batch noise correction during training 2026-03-10 09:05:57 -06:00
Jaret Burkett
b04c64e0f8 Add a dino version of DFE 2026-03-04 08:20:37 -07:00
Jaret Burkett
9dee42fc09 Updated supporters 2026-03-03 08:04:37 -07:00
Jaret Burkett
35978df8a3 Adjust defaults for ui graph to get and show all losses 2026-03-02 10:27:15 -07:00
Jaret Burkett
57d407cfd4 Add support for training lodestones/Zeta-Chroma 2026-03-01 12:52:29 -07:00
Jaret Burkett
40f995f616 Add method to do continuious lora merging in for low vram full finetuning. 2026-02-26 09:00:41 -07:00
Jaret Burkett
de7d22c9be Version bump 2026-02-19 11:58:15 -07:00
Jaret Burkett
1c74ca5d22 Add audio_loss_multiplier to scale audio loss to larger values if desired. 2026-02-19 11:57:44 -07:00
Jaret Burkett
3632656cda make DFE work with more VAEs 2026-02-18 09:46:37 -07:00
Jaret Burkett
a055947d56 Add signal_correction_noise_scale to config to scale the signal correction strength 2026-02-07 12:04:21 -07:00
Jaret Burkett
454722cc97 Add signal correction noise 2026-02-07 09:49:55 -07:00
Jaret Burkett
e82cf6eec2 Fixed issue that prevented full fine-tuning of flux2 models when using gradient checkpointing 2026-02-06 16:18:43 -07:00
Jaret Burkett
1422789452 Improved the method to augment random noise 2026-02-06 15:44:10 -07:00
Jaret Burkett
115f0a3670 Fixed error with wan models when caching text embeddings 2026-02-06 14:26:53 -07:00
Jaret Burkett
5c37db04f9 Added ability to activate experimental blank stabilization during training to zero out latents with blank prompts. 2026-02-04 13:00:03 -07:00
Jaret Burkett
42acb0d4be Build out an audio player card in preperation for audio datasets and samples. 2026-02-03 08:15:55 -07:00
Jaret Burkett
50664c2421 Version bump 2026-01-28 12:55:32 -07:00
Jaret Burkett
1ce2428722 Shrink text embeds to max token length for LTX-2. Drastically reduces cached text embedding sizes 2026-01-28 12:54:49 -07:00
Jaret Burkett
ea912d2d7b Increase default sample steps from 25 to 30 for z_image 2026-01-27 09:39:21 -07:00
Jaret Burkett
2db090144a Add support for Z-Image 2026-01-27 09:34:46 -07:00
Jaret Burkett
9ef6f1a828 Increase client body size to 100 gb 2026-01-24 12:44:17 -07:00
Jaret Burkett
f29272ee90 Update diffusers version with dgx 2026-01-19 14:06:38 -07:00
Jaret Burkett
a6da9e37ac Add support for FLUX.2 klein base models 2026-01-17 17:46:25 -07:00
Jaret Burkett
0efed794b4 Fix issue where flux2 would ignore single control image on training 2026-01-17 20:26:35 +00:00
Jaret Burkett
e132dbae76 Add number of repeats for a dataset in the ui 2026-01-15 08:03:31 -07:00
Jaret Burkett
e40d7ac605 Ignore i2v on ltx is training on images 2026-01-14 18:46:27 -07:00
Jaret Burkett
9848de7946 Fix issue with ltx cached latents if there is no audio. 2026-01-14 17:27:01 -07:00
Jaret Burkett
73dedbf662 Do caching of latents, first frame and audio when caching latents for LTX2 2026-01-14 11:05:23 -07:00
Jaret Burkett
64fe29b182 Support img 2 vid training for ltx-2 2026-01-13 19:04:56 -07:00