Jaret Burkett
78cf049c29
Add support for ACE-Step 1.5 and ACE-Step 1.5 XL. Also added dataset captioning through the UI. ( #785 )
...
* Base ace step 1.5 xl added. Generating, still wip on training and ui
* Base training code done
* Fix some issues with caching text embeddings. Update sample cards to show audio
* Fix issue with quantizing ace step
* Add album artwork to samples with waveform.
* Cleanup logs
* Add album art endpoint to speed up album art loading
* Made an make video with artwork script
* Make ui handle basic audio models. Make multi line adjustments to the editor and better syntax hilighting.
* Add prompt tagging system for special tagged models.
* prompt tagging processing for ui working.
* Moved default samples to a special file so we can add more when needed and they can be adjusted for a specific model
* Add a captioner job with music captioner that is prepped for use with the ui
* Add basit ui setup for captioning modal and handeling captioning jobs
* Starting captioning job from ui working. Still better management for it.
* Better filtering of job options in the job view for captioning jobs
* Added qwen3 vl as a captioner for images
* Have an indicator when a dataset is being captioned.
* Adjust the way caption jobs look in the queue
* Fix a few issues. Adjust defaults.
* Version bump
* Added ace step to the readme.
2026-04-09 15:02:03 -06:00
Jaret Burkett
171535833a
Add Mac OS support for Apple Silicon ( #770 )
...
* Made an install script and auto updates env for mac
* GPU sensors and initial training working for MAC. Still WIP.
* Switch dataloader to single threaded until I can work around some mac pickeling issues.
* Get quantization working on mac
* Fix mac exclusive imports so they don't break other builds.
* Add mac instructions to the UI
2026-03-30 09:37:47 -06:00
Rodrigo Reis
54f4732c9b
Fix the bug in temporal_compression data loader ( #754 )
2026-03-25 08:16:44 -06:00
Jaret Burkett
7f3309b291
Add support for audo frame count so datasets can have varrying length videos. Varous ltx 2.3 VAE optimizations such as removing tiling articacts, and doing frame split encoding to reduce vram on encoding/decoding.
2026-03-24 12:20:09 -06:00
Jaret Burkett
5642b656b9
Fix audio issues with ltx2 models. Silent codec fails now raised. Auto convert surround sound audio to stereo. Invalidate old caches just to be safe so they recache now.
2026-03-23 20:08:33 +00:00
Jaret Burkett
1ce2428722
Shrink text embeds to max token length for LTX-2. Drastically reduces cached text embedding sizes
2026-01-28 12:54:49 -07:00
Jaret Burkett
bf2700f7be
Initial support for finetuning qwen image. Will only work with caching for now, need to add controls everywhere.
2025-08-21 16:41:17 -06:00
Jaret Burkett
bb6db3d635
Added support for caching text embeddings. This is just initial support and will probably fail for some models. Still needs to be ompimized
2025-08-07 10:27:55 -06:00
Jaret Burkett
77dc38a574
Some work on caching text embeddings
2025-07-26 09:22:04 -06:00
Jaret Burkett
e5ed450dc7
Allow finetuning tiny autoencoder in vae trainer
2025-07-16 07:13:30 -06:00
Jaret Burkett
96ba2fd129
Added methods to the dataloader to automatically generate controls for line, mask, inpainting, depth, and pose.
2025-04-09 13:35:04 -06:00
Jaret Burkett
3d131fb27a
Added a file signature check on the dataset size caching system to invalidate cached dimensions if the file changes.
2025-04-01 07:39:36 -06:00
Jaret Burkett
860d892214
Pixel shuffle adapter. Some bug fixes thrown in
2025-03-29 21:15:01 -06:00
Jaret Burkett
b829983b16
Added ability to load video datasets and train with them
2025-03-19 09:54:26 -06:00
Jaret Burkett
5e663746b8
Working multi gpu training. Still need a lot of tweaks and testing.
2025-01-25 16:46:20 -07:00
Jaret Burkett
bedb8197a2
Fixed issue with sizes for some images being loaded sideways resulting in squished images.
2024-10-20 11:51:29 -06:00
Jaret Burkett
0640cdf569
Handle errors in loading size database
2024-10-20 07:04:19 -06:00
Jaret Burkett
af108bb964
Bug fix with dataloader. Added a flag to completly disable sampling
2024-08-12 09:19:40 -06:00
Jaret Burkett
6d31c6db73
Added a fix for windows dataloader
2024-08-11 10:48:24 -06:00
Jaret Burkett
22d2f6e28f
Fixed issue with grad scaling
2024-07-20 08:21:57 -06:00
Jaret Burkett
58dffd43a8
Added caching to image sizes so we dont do it every time.
2024-07-15 19:07:41 -06:00
Jaret Burkett
b96913d73c
Improvements to dataloader
2024-04-27 09:28:28 -06:00
Jaret Burkett
5da3613e0b
Bug fixes and minor features
2024-04-25 06:14:31 -06:00
Jaret Burkett
b0a0f28191
Bug fixes
2024-02-25 08:28:29 -07:00
Jaret Burkett
1ae1017748
Bug fixes. added ability to use l1 loss. varous other tests and improvements
2024-01-31 06:30:54 -07:00
Jaret Burkett
645b27f97a
Bug fixes with ip adapter training. Made a clip pre processor that can be trained with ip adapter to help augment the clip input to squeeze in more detail from a larget input. moved clip processing to the dataloader for speed.
2024-01-04 12:59:38 -07:00
Jaret Burkett
0f8daa5612
Bug fixes, work on maing IP adapters more customizable.
2023-12-24 08:32:39 -07:00
Jaret Burkett
1ee62562a4
diffirential guidance is WORKING (from what I can tell)
2023-11-07 19:24:12 -07:00
Jaret Burkett
07bf7bd7de
Allow augmentations and targeting different loss types fron the config file
2023-10-18 03:04:57 -06:00
Jaret Burkett
38e441a29c
allow flipping for point of interesting autocropping. allow num repeats. Fixed some bugs with new free u
2023-10-12 21:02:47 -06:00
Jaret Burkett
63ceffae24
Massive speed increases and ram optimizations
2023-10-10 06:07:55 -06:00
Jaret Burkett
f73402473b
Bug fixes. Added some functionality to help with private extensions
2023-10-05 07:09:34 -06:00
Jaret Burkett
579650eaf8
Fixed big issue with bucketing dataloader and added random cripping to a point of interest
2023-10-02 18:31:08 -06:00
Jaret Burkett
e5153d87c9
Fixed issues with dataloader bucketing. Allow using standard base image for t2i adapters.
2023-09-24 05:19:57 -06:00
Jaret Burkett
181f237a7b
added flipping x and y for dataset loader
2023-09-17 08:42:54 -06:00
Jaret Burkett
c698837241
Fixes to esrgan trainer. Moved logic for sd prompt embeddings out of diffusers pipeline so I can manipulate it
2023-09-16 17:41:07 -06:00
Jaret Burkett
ae70200d3c
Bug fixes, speed improvements, compatability adjustments withdiffusers updates
2023-09-13 07:03:53 -06:00
Jaret Burkett
d8d1e6fd1e
big fixes
2023-09-12 18:48:39 -06:00
Jaret Burkett
d74dd636ee
Memory optimizations. Default to using cudamalloc when torch 2.0 for mem allocation
2023-09-12 04:30:23 -06:00
Jaret Burkett
e8583860ad
Upgraded to dev for t2i on diffusers. Minor migrations to make it work.
2023-09-11 14:46:06 -06:00
Jaret Burkett
34bfeba229
Massive speed increase. Added latent caching both to disk and to memory
2023-09-10 08:54:49 -06:00
Jaret Burkett
41a3f63b72
allow smaller images in buckets and bucket them
2023-09-10 03:43:02 -06:00
Jaret Burkett
33267e117c
Reworked bucket loader to scale buckets to pixels amounts not just minimum size. Makes the network more consistant
2023-08-30 14:52:12 -06:00
Jaret Burkett
714854ee86
Hude rework to move the batch to a DTO to make it far more modular to the future ui
2023-08-29 10:22:19 -06:00
Jaret Burkett
e866c75638
Built base interfaces for a DTO to handle batch infomation transports for the dataloader
2023-08-28 12:43:31 -06:00
Jaret Burkett
fd338e67bb
Fixed bug with dataloader not seperating mulitple datasets
2023-08-26 18:07:24 -06:00
Jaret Burkett
8105c05c12
Added bucketting capabilities to dataloader. Finally have full planned capability. noice
2023-08-26 16:36:32 -06:00
Jaret Burkett
7157c316af
Added support for training lora, dreambooth, and fine tuning. Still need testing and docs
2023-08-23 15:37:00 -06:00
Jaret Burkett
d298240cec
Tied in ant tested TI script
2023-08-23 13:26:28 -06:00
Jaret Burkett
2e6c55c720
WIP creating textual inversion training script
2023-08-22 21:02:38 -06:00