ai-toolkit

mirror of https://github.com/ostris/ai-toolkit.git synced 2026-05-11 08:20:35 +00:00

Author	SHA1	Message	Date
Jaret Burkett	f38de2a2fe	Add tipsv2 locally and fix gradient checkpointing for it	2026-05-10 14:47:44 -06:00
Jaret Burkett	a12ddd72a1	Change the velocity weight cap on dfe 9	2026-05-07 07:37:05 -06:00
Jaret Burkett	6bb8acbffc	Add agreement_threshold default of 0.6 to automagic 2	2026-05-05 19:13:00 -06:00
Jaret Burkett	963a9f42b2	Add decode latent to wan 2.1 models. Add gradinet checkpointing to wan vae.	2026-05-05 11:30:16 -06:00
Jaret Burkett	4260a3c5b6	Add optimizer test suite and make minor speed adjustments to Automagicv2	2026-05-05 10:02:30 -06:00
Jaret Burkett	aeca7fe404	Add Automagic v2 optimizer. It uses significantly less vram and is much more efficient.	2026-05-05 09:09:07 -06:00
Jaret Burkett	af6458d1b5	Enable caching of ACE step latents.	2026-04-28 13:39:20 -06:00
Jaret Burkett	acc6a36214	Scale DFE 9 to a velocity equiv weight to match flow matching gradient strength. Probably need to rework all DFEs to do this as the math checks out.	2026-04-28 09:10:02 -06:00
Jaret Burkett	1fc4ad3979	Add sapiens2 as a diffusion feature extractor	2026-04-27 15:59:03 -06:00
Jaret Burkett	f4445cd78c	Added psuedo_huber loss	2026-04-19 09:51:46 -06:00
Jaret Burkett	488878f354	Use hidden layers in the loss for DFE 7 and 8	2026-04-18 13:07:38 -06:00
Jaret Burkett	beb40ae29b	Add DFE8 with partial step	2026-04-17 17:40:16 -06:00
Jaret Burkett	22ea3dd620	Fixed issue on some systems where Logger didnt have atty	2026-04-16 21:09:52 +00:00
Jaret Burkett	ab1ee4df34	Hotfix some issues with Wan models caused by diffusers and transformers updates	2026-04-16 20:53:50 +00:00
Jaret Burkett	2faba22b46	Fix issue when saving advanced prompt embeds. No such file or directory error	2026-04-16 12:22:56 -06:00
Jaret Burkett	e74bc9ac7b	Fix issue with concatinating advanced prompt embeds.	2026-04-14 16:04:34 -06:00
Jaret Burkett	7eb1226a6d	Fix issue with loading advanced prompt configs metadata	2026-04-14 15:42:09 -06:00
Asaf Agami	e868fca562	fix custom_flowmatch_sampler (#783 )	2026-04-13 09:42:05 -06:00
Jaret Burkett	233e292256	Added some experimental low step things for zeta	2026-04-13 09:37:34 -06:00
Jaret Burkett	1058ef3513	Made AdvancedPromptEmbeds that is compatable with previous PromptEmbeds functionality, but is more streamlined and can accomidate more model embedding paradigms.	2026-04-11 10:45:47 -06:00
Jaret Burkett	9b4e2d1b0b	More flac support	2026-04-10 12:27:09 -06:00
Jaret Burkett	bfb373c8fa	Prep for future breaking changes in newer versions of transformers library	2026-04-10 12:04:32 -06:00
Jaret Burkett	78cf049c29	Add support for ACE-Step 1.5 and ACE-Step 1.5 XL. Also added dataset captioning through the UI. (#785 ) * Base ace step 1.5 xl added. Generating, still wip on training and ui * Base training code done * Fix some issues with caching text embeddings. Update sample cards to show audio * Fix issue with quantizing ace step * Add album artwork to samples with waveform. * Cleanup logs * Add album art endpoint to speed up album art loading * Made an make video with artwork script * Make ui handle basic audio models. Make multi line adjustments to the editor and better syntax hilighting. * Add prompt tagging system for special tagged models. * prompt tagging processing for ui working. * Moved default samples to a special file so we can add more when needed and they can be adjusted for a specific model * Add a captioner job with music captioner that is prepped for use with the ui * Add basit ui setup for captioning modal and handeling captioning jobs * Starting captioning job from ui working. Still better management for it. * Better filtering of job options in the job view for captioning jobs * Added qwen3 vl as a captioner for images * Have an indicator when a dataset is being captioned. * Adjust the way caption jobs look in the queue * Fix a few issues. Adjust defaults. * Version bump * Added ace step to the readme.	2026-04-09 15:02:03 -06:00
Jaret Burkett	da2a79590f	Add a merge network on save strength	2026-04-01 09:21:08 -06:00
Jaret Burkett	4a3251640a	More work on compiling models	2026-03-31 12:11:56 -06:00
Jaret Burkett	171535833a	Add Mac OS support for Apple Silicon (#770 ) * Made an install script and auto updates env for mac * GPU sensors and initial training working for MAC. Still WIP. * Switch dataloader to single threaded until I can work around some mac pickeling issues. * Get quantization working on mac * Fix mac exclusive imports so they don't break other builds. * Add mac instructions to the UI	2026-03-30 09:37:47 -06:00
Jaret Burkett	6a1fc54779	Add t0 loss target	2026-03-28 13:35:21 -06:00
Jaret Burkett	f85bf065bf	Use pooler embeddings for DFE v6 with dino v3	2026-03-27 07:02:07 -06:00
Rodrigo Reis	54f4732c9b	Fix the bug in temporal_compression data loader (#754 )	2026-03-25 08:16:44 -06:00
Jaret Burkett	7f3309b291	Add support for audo frame count so datasets can have varrying length videos. Varous ltx 2.3 VAE optimizations such as removing tiling articacts, and doing frame split encoding to reduce vram on encoding/decoding.	2026-03-24 12:20:09 -06:00
Jaret Burkett	5642b656b9	Fix audio issues with ltx2 models. Silent codec fails now raised. Auto convert surround sound audio to stereo. Invalidate old caches just to be safe so they recache now.	2026-03-23 20:08:33 +00:00
Jaret Burkett	0f075fc45e	Adjust signal amplification target. Allow signal amplification strength in config.	2026-03-22 08:30:13 -06:00
Jaret Burkett	dcd98dc0d5	Add signal amplification	2026-03-21 07:44:18 -06:00
Jaret Burkett	35b1cde3cb	Fixed issue on z-image that prevented training at a larger batch size	2026-03-10 15:43:25 -06:00
Jaret Burkett	4909b809c7	Fixed issue with audio loss multiplier.	2026-03-10 15:16:09 -06:00
Jaret Burkett	06ef3d343a	add ability to use batch noise correction during training	2026-03-10 09:05:57 -06:00
Jaret Burkett	b04c64e0f8	Add a dino version of DFE	2026-03-04 08:20:37 -07:00
Jaret Burkett	40f995f616	Add method to do continuious lora merging in for low vram full finetuning.	2026-02-26 09:00:41 -07:00
Jaret Burkett	1c74ca5d22	Add audio_loss_multiplier to scale audio loss to larger values if desired.	2026-02-19 11:57:44 -07:00
Jaret Burkett	3632656cda	make DFE work with more VAEs	2026-02-18 09:46:37 -07:00
Jaret Burkett	a055947d56	Add signal_correction_noise_scale to config to scale the signal correction strength	2026-02-07 12:04:21 -07:00
Jaret Burkett	454722cc97	Add signal correction noise	2026-02-07 09:49:55 -07:00
Jaret Burkett	115f0a3670	Fixed error with wan models when caching text embeddings	2026-02-06 14:26:53 -07:00
Jaret Burkett	5c37db04f9	Added ability to activate experimental blank stabilization during training to zero out latents with blank prompts.	2026-02-04 13:00:03 -07:00
Jaret Burkett	1ce2428722	Shrink text embeds to max token length for LTX-2. Drastically reduces cached text embedding sizes	2026-01-28 12:54:49 -07:00
Jaret Burkett	73dedbf662	Do caching of latents, first frame and audio when caching latents for LTX2	2026-01-14 11:05:23 -07:00
Jaret Burkett	5b5aadadb8	Add LTX-2 Support (#644 ) * WIP, adding support for LTX2 * Training on images working * Fix loading comfy models * Handle converting and deconverting lora so it matches original format * Reworked ui to habdle ltx and propert dataset default overwriting. * Update the way lokr saves to it is more compatable with comfy * Audio loading and synchronization/resampling is working * Add audio to training. Does it work? Maybe, still testing. * Fixed fps default issue for sound * Have ui set fps for accurate audio mapping on ltx * Added audio procession options to the ui for ltx * Clean up requirements	2026-01-13 04:55:30 -07:00
Jaret Burkett	a767b82b60	Fixed issue with new logger when ooming	2025-12-25 16:57:34 +00:00
Jaret Burkett	87edca1b2b	Added initial support to initiate lora training from an existing lora	2025-12-22 12:49:15 -07:00
Jaret Burkett	8864ba915e	Remove easy-dwpose from the default requierments	2025-12-20 07:16:20 -07:00

1 2 3 4 5 ...

631 Commits