15 Commits

Author SHA1 Message Date
Jaret Burkett
9ca58e9aa2 Fixed offload and quantize order of ltx 2.3 text encoder. 2026-04-07 15:11:50 -06:00
Jaret Burkett
dfde30f231 Fix issue with ltx2 custom te repo path 2026-03-25 09:50:18 -06:00
Jaret Burkett
7f3309b291 Add support for audo frame count so datasets can have varrying length videos. Varous ltx 2.3 VAE optimizations such as removing tiling articacts, and doing frame split encoding to reduce vram on encoding/decoding. 2026-03-24 12:20:09 -06:00
Jaret Burkett
5642b656b9 Fix audio issues with ltx2 models. Silent codec fails now raised. Auto convert surround sound audio to stereo. Invalidate old caches just to be safe so they recache now. 2026-03-23 20:08:33 +00:00
Jaret Burkett
561e6f201c Fixed an issue with ltx 2.3 i2v training 2026-03-23 12:41:18 -06:00
Jaret Burkett
e91827f9be Change gemma repo to lightricks one that is not gated 2026-03-23 11:00:32 -06:00
Jaret Burkett
253cb31362 Fix issue with video and images with no audio on ltx models 2026-03-22 22:09:23 -06:00
Jaret Burkett
4a3d317e2b Fix issue with using the default text encoder with ltx 2.3 2026-03-22 18:53:59 -06:00
Jaret Burkett
859635e95b Add support for training LTX 2.3 (#745)
* Initial support for ltx 2.3. Still needs a lot of testing to make sure it is all right.

* bump version

* Handle lora renaming keys for new ltx 2.3 layers
2026-03-22 17:56:59 -06:00
Jaret Burkett
1ce2428722 Shrink text embeds to max token length for LTX-2. Drastically reduces cached text embedding sizes 2026-01-28 12:54:49 -07:00
Jaret Burkett
e40d7ac605 Ignore i2v on ltx is training on images 2026-01-14 18:46:27 -07:00
Jaret Burkett
9848de7946 Fix issue with ltx cached latents if there is no audio. 2026-01-14 17:27:01 -07:00
Jaret Burkett
73dedbf662 Do caching of latents, first frame and audio when caching latents for LTX2 2026-01-14 11:05:23 -07:00
Jaret Burkett
64fe29b182 Support img 2 vid training for ltx-2 2026-01-13 19:04:56 -07:00
Jaret Burkett
5b5aadadb8 Add LTX-2 Support (#644)
* WIP, adding support for LTX2

* Training on images working

* Fix loading comfy models

* Handle converting and deconverting lora so it matches original format

* Reworked ui to habdle ltx and propert dataset default overwriting.

* Update the way lokr saves to it is more compatable with comfy

* Audio loading and synchronization/resampling is working

* Add audio to training. Does it work? Maybe, still testing.

* Fixed fps default issue for sound

* Have ui set fps for accurate audio mapping on ltx

* Added audio procession options to the ui for ltx

* Clean up requirements
2026-01-13 04:55:30 -07:00