ai-toolkit

mirror of https://github.com/ostris/ai-toolkit.git synced 2026-04-22 23:39:21 +00:00

Author	SHA1	Message	Date
Jaret Burkett	9ca58e9aa2	Fixed offload and quantize order of ltx 2.3 text encoder.	2026-04-07 15:11:50 -06:00
Jaret Burkett	dfde30f231	Fix issue with ltx2 custom te repo path	2026-03-25 09:50:18 -06:00
Jaret Burkett	7f3309b291	Add support for audo frame count so datasets can have varrying length videos. Varous ltx 2.3 VAE optimizations such as removing tiling articacts, and doing frame split encoding to reduce vram on encoding/decoding.	2026-03-24 12:20:09 -06:00
Jaret Burkett	5642b656b9	Fix audio issues with ltx2 models. Silent codec fails now raised. Auto convert surround sound audio to stereo. Invalidate old caches just to be safe so they recache now.	2026-03-23 20:08:33 +00:00
Jaret Burkett	561e6f201c	Fixed an issue with ltx 2.3 i2v training	2026-03-23 12:41:18 -06:00
Jaret Burkett	e91827f9be	Change gemma repo to lightricks one that is not gated	2026-03-23 11:00:32 -06:00
Jaret Burkett	253cb31362	Fix issue with video and images with no audio on ltx models	2026-03-22 22:09:23 -06:00
Jaret Burkett	4a3d317e2b	Fix issue with using the default text encoder with ltx 2.3	2026-03-22 18:53:59 -06:00
Jaret Burkett	859635e95b	Add support for training LTX 2.3 (#745 ) * Initial support for ltx 2.3. Still needs a lot of testing to make sure it is all right. * bump version * Handle lora renaming keys for new ltx 2.3 layers	2026-03-22 17:56:59 -06:00
Jaret Burkett	1ce2428722	Shrink text embeds to max token length for LTX-2. Drastically reduces cached text embedding sizes	2026-01-28 12:54:49 -07:00
Jaret Burkett	e40d7ac605	Ignore i2v on ltx is training on images	2026-01-14 18:46:27 -07:00
Jaret Burkett	9848de7946	Fix issue with ltx cached latents if there is no audio.	2026-01-14 17:27:01 -07:00
Jaret Burkett	73dedbf662	Do caching of latents, first frame and audio when caching latents for LTX2	2026-01-14 11:05:23 -07:00
Jaret Burkett	64fe29b182	Support img 2 vid training for ltx-2	2026-01-13 19:04:56 -07:00
Jaret Burkett	5b5aadadb8	Add LTX-2 Support (#644 ) * WIP, adding support for LTX2 * Training on images working * Fix loading comfy models * Handle converting and deconverting lora so it matches original format * Reworked ui to habdle ltx and propert dataset default overwriting. * Update the way lokr saves to it is more compatable with comfy * Audio loading and synchronization/resampling is working * Add audio to training. Does it work? Maybe, still testing. * Fixed fps default issue for sound * Have ui set fps for accurate audio mapping on ltx * Added audio procession options to the ui for ltx * Clean up requirements	2026-01-13 04:55:30 -07:00

15 Commits