631 Commits

Author SHA1 Message Date
Jaret Burkett
f38de2a2fe Add tipsv2 locally and fix gradient checkpointing for it 2026-05-10 14:47:44 -06:00
Jaret Burkett
a12ddd72a1 Change the velocity weight cap on dfe 9 2026-05-07 07:37:05 -06:00
Jaret Burkett
6bb8acbffc Add agreement_threshold default of 0.6 to automagic 2 2026-05-05 19:13:00 -06:00
Jaret Burkett
963a9f42b2 Add decode latent to wan 2.1 models. Add gradinet checkpointing to wan vae. 2026-05-05 11:30:16 -06:00
Jaret Burkett
4260a3c5b6 Add optimizer test suite and make minor speed adjustments to Automagicv2 2026-05-05 10:02:30 -06:00
Jaret Burkett
aeca7fe404 Add Automagic v2 optimizer. It uses significantly less vram and is much more efficient. 2026-05-05 09:09:07 -06:00
Jaret Burkett
af6458d1b5 Enable caching of ACE step latents. 2026-04-28 13:39:20 -06:00
Jaret Burkett
acc6a36214 Scale DFE 9 to a velocity equiv weight to match flow matching gradient strength. Probably need to rework all DFEs to do this as the math checks out. 2026-04-28 09:10:02 -06:00
Jaret Burkett
1fc4ad3979 Add sapiens2 as a diffusion feature extractor 2026-04-27 15:59:03 -06:00
Jaret Burkett
f4445cd78c Added psuedo_huber loss 2026-04-19 09:51:46 -06:00
Jaret Burkett
488878f354 Use hidden layers in the loss for DFE 7 and 8 2026-04-18 13:07:38 -06:00
Jaret Burkett
beb40ae29b Add DFE8 with partial step 2026-04-17 17:40:16 -06:00
Jaret Burkett
22ea3dd620 Fixed issue on some systems where Logger didnt have atty 2026-04-16 21:09:52 +00:00
Jaret Burkett
ab1ee4df34 Hotfix some issues with Wan models caused by diffusers and transformers updates 2026-04-16 20:53:50 +00:00
Jaret Burkett
2faba22b46 Fix issue when saving advanced prompt embeds. No such file or directory error 2026-04-16 12:22:56 -06:00
Jaret Burkett
e74bc9ac7b Fix issue with concatinating advanced prompt embeds. 2026-04-14 16:04:34 -06:00
Jaret Burkett
7eb1226a6d Fix issue with loading advanced prompt configs metadata 2026-04-14 15:42:09 -06:00
Asaf Agami
e868fca562 fix custom_flowmatch_sampler (#783) 2026-04-13 09:42:05 -06:00
Jaret Burkett
233e292256 Added some experimental low step things for zeta 2026-04-13 09:37:34 -06:00
Jaret Burkett
1058ef3513 Made AdvancedPromptEmbeds that is compatable with previous PromptEmbeds functionality, but is more streamlined and can accomidate more model embedding paradigms. 2026-04-11 10:45:47 -06:00
Jaret Burkett
9b4e2d1b0b More flac support 2026-04-10 12:27:09 -06:00
Jaret Burkett
bfb373c8fa Prep for future breaking changes in newer versions of transformers library 2026-04-10 12:04:32 -06:00
Jaret Burkett
78cf049c29 Add support for ACE-Step 1.5 and ACE-Step 1.5 XL. Also added dataset captioning through the UI. (#785)
* Base ace step 1.5 xl added. Generating, still wip on training and ui

* Base training code done

* Fix some issues with caching text embeddings. Update sample cards to show audio

* Fix issue with quantizing ace step

* Add album artwork to samples with waveform.

* Cleanup logs

* Add album art endpoint to speed up album art loading

* Made an make video with artwork script

* Make ui handle basic audio models. Make multi line adjustments to the editor and better syntax hilighting.

* Add prompt tagging system for special tagged models.

* prompt tagging processing for ui working.

* Moved default samples to a special file so we can add more when needed and they can be adjusted for a specific model

* Add a captioner job with music captioner that is prepped for use with the ui

* Add basit ui setup for captioning modal and handeling captioning jobs

* Starting captioning job from ui working. Still better management for it.

* Better filtering of job options in the job view for captioning jobs

* Added qwen3 vl as a captioner for images

* Have an indicator when a dataset is being captioned.

* Adjust the way caption jobs look in the queue

* Fix a few issues. Adjust defaults.

* Version bump

* Added ace step to the readme.
2026-04-09 15:02:03 -06:00
Jaret Burkett
da2a79590f Add a merge network on save strength 2026-04-01 09:21:08 -06:00
Jaret Burkett
4a3251640a More work on compiling models 2026-03-31 12:11:56 -06:00
Jaret Burkett
171535833a Add Mac OS support for Apple Silicon (#770)
* Made an install script and auto updates env for mac

* GPU sensors and initial training working for MAC. Still WIP.

* Switch dataloader to single threaded until I can work around some mac pickeling issues.

* Get quantization working on mac

* Fix mac exclusive imports so they don't break other builds.

* Add mac instructions to the UI
2026-03-30 09:37:47 -06:00
Jaret Burkett
6a1fc54779 Add t0 loss target 2026-03-28 13:35:21 -06:00
Jaret Burkett
f85bf065bf Use pooler embeddings for DFE v6 with dino v3 2026-03-27 07:02:07 -06:00
Rodrigo Reis
54f4732c9b Fix the bug in temporal_compression data loader (#754) 2026-03-25 08:16:44 -06:00
Jaret Burkett
7f3309b291 Add support for audo frame count so datasets can have varrying length videos. Varous ltx 2.3 VAE optimizations such as removing tiling articacts, and doing frame split encoding to reduce vram on encoding/decoding. 2026-03-24 12:20:09 -06:00
Jaret Burkett
5642b656b9 Fix audio issues with ltx2 models. Silent codec fails now raised. Auto convert surround sound audio to stereo. Invalidate old caches just to be safe so they recache now. 2026-03-23 20:08:33 +00:00
Jaret Burkett
0f075fc45e Adjust signal amplification target. Allow signal amplification strength in config. 2026-03-22 08:30:13 -06:00
Jaret Burkett
dcd98dc0d5 Add signal amplification 2026-03-21 07:44:18 -06:00
Jaret Burkett
35b1cde3cb Fixed issue on z-image that prevented training at a larger batch size 2026-03-10 15:43:25 -06:00
Jaret Burkett
4909b809c7 Fixed issue with audio loss multiplier. 2026-03-10 15:16:09 -06:00
Jaret Burkett
06ef3d343a add ability to use batch noise correction during training 2026-03-10 09:05:57 -06:00
Jaret Burkett
b04c64e0f8 Add a dino version of DFE 2026-03-04 08:20:37 -07:00
Jaret Burkett
40f995f616 Add method to do continuious lora merging in for low vram full finetuning. 2026-02-26 09:00:41 -07:00
Jaret Burkett
1c74ca5d22 Add audio_loss_multiplier to scale audio loss to larger values if desired. 2026-02-19 11:57:44 -07:00
Jaret Burkett
3632656cda make DFE work with more VAEs 2026-02-18 09:46:37 -07:00
Jaret Burkett
a055947d56 Add signal_correction_noise_scale to config to scale the signal correction strength 2026-02-07 12:04:21 -07:00
Jaret Burkett
454722cc97 Add signal correction noise 2026-02-07 09:49:55 -07:00
Jaret Burkett
115f0a3670 Fixed error with wan models when caching text embeddings 2026-02-06 14:26:53 -07:00
Jaret Burkett
5c37db04f9 Added ability to activate experimental blank stabilization during training to zero out latents with blank prompts. 2026-02-04 13:00:03 -07:00
Jaret Burkett
1ce2428722 Shrink text embeds to max token length for LTX-2. Drastically reduces cached text embedding sizes 2026-01-28 12:54:49 -07:00
Jaret Burkett
73dedbf662 Do caching of latents, first frame and audio when caching latents for LTX2 2026-01-14 11:05:23 -07:00
Jaret Burkett
5b5aadadb8 Add LTX-2 Support (#644)
* WIP, adding support for LTX2

* Training on images working

* Fix loading comfy models

* Handle converting and deconverting lora so it matches original format

* Reworked ui to habdle ltx and propert dataset default overwriting.

* Update the way lokr saves to it is more compatable with comfy

* Audio loading and synchronization/resampling is working

* Add audio to training. Does it work? Maybe, still testing.

* Fixed fps default issue for sound

* Have ui set fps for accurate audio mapping on ltx

* Added audio procession options to the ui for ltx

* Clean up requirements
2026-01-13 04:55:30 -07:00
Jaret Burkett
a767b82b60 Fixed issue with new logger when ooming 2025-12-25 16:57:34 +00:00
Jaret Burkett
87edca1b2b Added initial support to initiate lora training from an existing lora 2025-12-22 12:49:15 -07:00
Jaret Burkett
8864ba915e Remove easy-dwpose from the default requierments 2025-12-20 07:16:20 -07:00