ai-toolkit

mirror of https://github.com/ostris/ai-toolkit.git synced 2026-05-12 00:42:07 +00:00

Author	SHA1	Message	Date
Jaret Burkett	9b4e2d1b0b	More flac support	2026-04-10 12:27:09 -06:00
Jaret Burkett	78cf049c29	Add support for ACE-Step 1.5 and ACE-Step 1.5 XL. Also added dataset captioning through the UI. (#785 ) * Base ace step 1.5 xl added. Generating, still wip on training and ui * Base training code done * Fix some issues with caching text embeddings. Update sample cards to show audio * Fix issue with quantizing ace step * Add album artwork to samples with waveform. * Cleanup logs * Add album art endpoint to speed up album art loading * Made an make video with artwork script * Make ui handle basic audio models. Make multi line adjustments to the editor and better syntax hilighting. * Add prompt tagging system for special tagged models. * prompt tagging processing for ui working. * Moved default samples to a special file so we can add more when needed and they can be adjusted for a specific model * Add a captioner job with music captioner that is prepped for use with the ui * Add basit ui setup for captioning modal and handeling captioning jobs * Starting captioning job from ui working. Still better management for it. * Better filtering of job options in the job view for captioning jobs * Added qwen3 vl as a captioner for images * Have an indicator when a dataset is being captioned. * Adjust the way caption jobs look in the queue * Fix a few issues. Adjust defaults. * Version bump * Added ace step to the readme.	2026-04-09 15:02:03 -06:00
Jaret Burkett	da2a79590f	Add a merge network on save strength	2026-04-01 09:21:08 -06:00
Jaret Burkett	4a3251640a	More work on compiling models	2026-03-31 12:11:56 -06:00
Jaret Burkett	171535833a	Add Mac OS support for Apple Silicon (#770 ) * Made an install script and auto updates env for mac * GPU sensors and initial training working for MAC. Still WIP. * Switch dataloader to single threaded until I can work around some mac pickeling issues. * Get quantization working on mac * Fix mac exclusive imports so they don't break other builds. * Add mac instructions to the UI	2026-03-30 09:37:47 -06:00
Jaret Burkett	6a1fc54779	Add t0 loss target	2026-03-28 13:35:21 -06:00
Jaret Burkett	7f3309b291	Add support for audo frame count so datasets can have varrying length videos. Varous ltx 2.3 VAE optimizations such as removing tiling articacts, and doing frame split encoding to reduce vram on encoding/decoding.	2026-03-24 12:20:09 -06:00
Jaret Burkett	0f075fc45e	Adjust signal amplification target. Allow signal amplification strength in config.	2026-03-22 08:30:13 -06:00
Jaret Burkett	dcd98dc0d5	Add signal amplification	2026-03-21 07:44:18 -06:00
Jaret Burkett	4909b809c7	Fixed issue with audio loss multiplier.	2026-03-10 15:16:09 -06:00
Jaret Burkett	06ef3d343a	add ability to use batch noise correction during training	2026-03-10 09:05:57 -06:00
Jaret Burkett	1c74ca5d22	Add audio_loss_multiplier to scale audio loss to larger values if desired.	2026-02-19 11:57:44 -07:00
Jaret Burkett	a055947d56	Add signal_correction_noise_scale to config to scale the signal correction strength	2026-02-07 12:04:21 -07:00
Jaret Burkett	454722cc97	Add signal correction noise	2026-02-07 09:49:55 -07:00
Jaret Burkett	5c37db04f9	Added ability to activate experimental blank stabilization during training to zero out latents with blank prompts.	2026-02-04 13:00:03 -07:00
Jaret Burkett	5b5aadadb8	Add LTX-2 Support (#644 ) * WIP, adding support for LTX2 * Training on images working * Fix loading comfy models * Handle converting and deconverting lora so it matches original format * Reworked ui to habdle ltx and propert dataset default overwriting. * Update the way lokr saves to it is more compatable with comfy * Audio loading and synchronization/resampling is working * Add audio to training. Does it work? Maybe, still testing. * Fixed fps default issue for sound * Have ui set fps for accurate audio mapping on ltx * Added audio procession options to the ui for ltx * Clean up requirements	2026-01-13 04:55:30 -07:00
Jaret Burkett	87edca1b2b	Added initial support to initiate lora training from an existing lora	2025-12-22 12:49:15 -07:00
Jaret Burkett	ba00eea7d9	Add loss graph to the ui	2025-12-18 10:08:59 -07:00
Jaret Burkett	e6c5aead3b	Fix issue that prevented ramtorch layer offloading with z_image	2025-12-02 16:14:34 -07:00
Jaret Burkett	2e7b2d9926	Added Differential Guidance training target	2025-11-10 09:38:25 -07:00
Jaret Burkett	6f308fc46e	When soing guidance loss, make CFG zero an optional target instead of a forced one.	2025-11-04 09:16:15 -07:00
Jaret Burkett	ee206cfa18	Added blank prompt preservation	2025-10-22 14:55:13 -06:00
Jaret Burkett	0d8a33dc16	Offload ARA with the layer if doing layer offloading. Add support to offload the LoRA. Still needs optimizer support	2025-10-21 06:03:27 -06:00
Jaret Burkett	1f81bc4060	Fix issue where text encoder could be the wrong quantization and fail when using memory manager	2025-10-15 11:01:30 -06:00
Jaret Burkett	1bc6dee127	Change auto_memory to be layer_offloading and allow you to set the amount to unload	2025-10-10 13:12:32 -06:00
Jaret Burkett	4e5707854f	Initial support for RamTorch. Still a WIP	2025-10-05 13:03:26 -06:00
Jaret Burkett	3086a58e5b	git status	2025-10-01 14:12:17 -06:00
Jaret Burkett	3b1f7b0948	Allow user to set the attention backend. Add method to recomver from the occasional OOM if it is a rare event. Still exit if it ooms 3 times in a row.	2025-09-27 08:56:15 -06:00
Jaret Burkett	1069dee0e4	Added ui sopport for multi control samples and datasets. Added qwen image edit 5209 to the ui	2025-09-25 11:10:02 -06:00
Jaret Burkett	454be0958a	Initial support for qwen image edit plus	2025-09-24 11:39:10 -06:00
Jaret Burkett	3cdf50cbfc	Merge pull request #426 from squewel/prior_reg Dataset-level prior regularization	2025-09-18 03:03:18 -06:00
max	e4ae97e790	add dataset-level distillation-style regularization	2025-09-18 01:11:19 +03:00
Jaret Burkett	218f673e3d	Added support for new concept slider training script to CLI and UI	2025-09-16 10:22:34 -06:00
Jaret Burkett	af6fdaaaf9	Add ability to train a full rank LoRA. (experimental)	2025-09-09 07:36:25 -06:00
Jaret Burkett	f699f4be5f	Add ability to set transparent color for control images	2025-09-02 11:08:44 -06:00
Jaret Burkett	85dcae6e2b	Set full size control images to default true	2025-09-02 10:30:42 -06:00
Jaret Burkett	7040d8d73b	Preperation for audio	2025-09-02 07:26:50 -06:00
Jaret Burkett	bf2700f7be	Initial support for finetuning qwen image. Will only work with caching for now, need to add controls everywhere.	2025-08-21 16:41:17 -06:00
Jaret Burkett	8ea2cf00f6	Added training to the ui. Still testing, but everything seems to be working.	2025-08-16 05:51:37 -06:00
Jaret Burkett	3413fa537f	Wan22 14b training is working, still need tons of testing and some bug fixes	2025-08-14 13:03:27 -06:00
Jaret Burkett	85bad57df3	Fix bug that would use EMA when set false	2025-08-13 11:39:40 -06:00
Jaret Burkett	77b10d884d	Add support for training with an accuracy recovery adapter with qwen image	2025-08-12 08:21:36 -06:00
Jaret Burkett	bb6db3d635	Added support for caching text embeddings. This is just initial support and will probably fail for some models. Still needs to be ompimized	2025-08-07 10:27:55 -06:00
Jaret Burkett	5d8922fca2	Add ability to designate a dataset as i2v or t2v for models that support it	2025-08-06 09:29:47 -06:00
Jaret Burkett	77dc38a574	Some work on caching text embeddings	2025-07-26 09:22:04 -06:00
Jaret Burkett	f500b9f240	Add ability to do more advanced sample prompt objects to prepart for a UI rework on control images and other things.	2025-07-17 07:13:35 -06:00
Jaret Burkett	60ef2f1df7	Added support for FLUX.1-Kontext-dev	2025-06-26 15:24:37 -06:00
Jaret Burkett	f3eb1dff42	Add a config flag to trigger fast image size db builder. Add config flag to set unconditional prompt for guidance loss	2025-06-24 08:51:29 -06:00
Jaret Burkett	ba1274d99e	Added a guidance burning loss. Modified DFE to work with new model. Bug fixes	2025-06-23 08:38:27 -06:00
Jaret Burkett	8602470952	Updated diffusion feature extractor	2025-06-19 15:36:10 -06:00

1 2 3 4 5 ...

253 Commits