Jaret Burkett
6021a3dbc0
Change inpainting mask to zero out on latents instead of image for inpaint area.
2025-03-24 14:16:52 -06:00
Jaret Burkett
45be82d5d6
Handle inpainting training for control_lora adapter
2025-03-24 13:17:47 -06:00
Jaret Burkett
f10937e6da
Handle multi control inputs for control lora training
2025-03-23 07:37:08 -06:00
Jaret Burkett
f5aa4232fa
Added ability to quantize with torchao
2025-03-20 16:28:54 -06:00
Jaret Burkett
b829983b16
Added ability to load video datasets and train with them
2025-03-19 09:54:26 -06:00
Jaret Burkett
3812957bc9
Added ability to train control loras. Other important bug fixes thrown in
2025-03-14 18:03:00 -06:00
Jaret Burkett
386e68a422
Fixed a bug that changes all samples to webp
2025-03-08 18:02:56 -07:00
Jaret Burkett
391cf80fea
Added training for Wan2.1. Not finalized, wait.
2025-03-07 13:53:44 -07:00
Jaret Burkett
6f6fb90812
Added cogview4. Loss still needs work.
2025-03-04 18:43:52 -07:00
Jaret Burkett
8bb47d1bfe
Merge branch 'main' into wan21
2025-03-04 00:31:57 -07:00
Jaret Burkett
7ae31c9ae9
Added LoKr to the ui
2025-03-02 08:49:01 -07:00
Jaret Burkett
b16819f8e7
Added LoKr support
2025-03-02 06:57:50 -07:00
Jaret Burkett
acc79956aa
WIP create new class to add new models more easily
2025-03-01 13:49:02 -07:00
Jaret Burkett
f6e16e582a
Added Differential Output Preservation Loss to trainer and ui
2025-02-25 20:12:36 -07:00
Jaret Burkett
b366e46f1c
Added more settings to the training config
2025-02-23 12:34:52 -07:00
Jaret Burkett
4af6c5cf30
Work on supporting flex.2 potential arch
2025-02-17 14:10:25 -07:00
Jaret Burkett
87e557cf1e
Bug fixes and improvements to llmadapter
2025-02-15 07:18:07 -07:00
Jaret Burkett
bd8d7dc081
fixed various issues with llm attention masking. Added block training on the llm adapter.
2025-02-14 11:24:01 -07:00
Jaret Burkett
2622de1e01
DFE tweaks. Adding support for more llms as text encoders
2025-02-13 04:31:49 -07:00
Jaret Burkett
0b8a32def7
merged in lumina2 branch
2025-02-12 09:33:03 -07:00
Jaret Burkett
787bb37e76
Small fixed for DFE, polar guidance, and other things
2025-02-12 09:27:44 -07:00
Jaret Burkett
d138f07365
Imitial lumina3 support
2025-02-08 10:59:53 -07:00
Jaret Burkett
c6d8eedb94
Added ability to use consistent noise for each image in a dataset by hashing the path and using that as a seed.
2025-02-08 07:13:48 -07:00
Jaret Burkett
216ab164ce
Experimental features and bug fixes
2025-02-04 13:36:34 -07:00
Jaret Burkett
04abe57c76
Added weighing to DFE
2025-01-22 08:50:57 -07:00
Jaret Burkett
89dd041b97
Added ability to pair samples with a closer noise with optimal_noise_pairing_samples
2025-01-21 18:30:10 -07:00
Jaret Burkett
29122b1a54
Added code to handle diffusion feature extraction loss
2025-01-21 14:21:34 -07:00
Jaret Burkett
fadb2f3a76
Allow quantizing the te independently on flux. added lognorm_blend timestep schedule
2025-01-18 18:02:31 -07:00
Jaret Burkett
4723f23c0d
Added ability to split up flux across gpus (experimental). Changed the way timestep scheduling works to prep for more specific schedules.
2024-12-31 07:06:55 -07:00
Jaret Burkett
8ef07a9c36
Added training for an experimental decoratgor embedding. Allow for turning off guidance embedding on flux (for unreleased model). Various bug fixes and modifications
2024-12-15 08:59:27 -07:00
Jaret Burkett
96d418bb95
Added support for full finetuning flux with randomized param activation. Examples coming soon
2024-11-21 13:05:32 -07:00
Jaret Burkett
4aa19b5c1d
Only quantize flux T5 is also quantizing model. Load TE from original name and path if fine tuning.
2024-10-29 14:25:31 -06:00
Jaret Burkett
22cd40d7b9
Improvements for full tuning flux. Added debugging launch config for vscode
2024-10-29 04:54:08 -06:00
Jaret Burkett
9f94c7b61e
Added experimental param multiplier to the ema module
2024-10-22 09:25:52 -06:00
Jaret Burkett
3922981996
Added some additional experimental things to the vision direct encoder
2024-10-10 19:42:26 +00:00
Jaret Burkett
a800c9d19e
Add a method to have an inference only lora
2024-10-04 10:06:53 -06:00
Jaret Burkett
67e0aca750
Added ability to load clip pairs randomly from folder. Other small bug fixes
2024-10-03 10:03:49 -06:00
Jaret Burkett
e4c82803e1
Handle random resizing for pixtral input on direct vision adapter
2024-09-28 14:53:38 -06:00
Jaret Burkett
58537fc92b
Added initial direct vision pixtral support
2024-09-28 10:47:51 -06:00
Jaret Burkett
04424fe2d6
Added config setting to set the timestep type
2024-09-24 06:53:59 -06:00
Jaret Burkett
2776221497
Added option to cache empty prompt or trigger and unload text encoders while training
2024-09-21 20:54:09 -06:00
Plat
79b4e04b80
Feat: Wandb logging ( #95 )
...
* wandb logging
* fix: start logging before train loop
* chore: add wandb dir to gitignore
* fix: wrap wandb functions
* fix: forget to send last samples
* chore: use valid type
* chore: use None when not type-checking
* chore: resolved complicated logic
* fix: follow log_every
---------
Co-authored-by: Plat <github@p1at.dev >
Co-authored-by: Jaret Burkett <jaretburkett@gmail.com >
2024-09-19 20:01:01 -06:00
Jaret Burkett
951e223481
Added support to disable single transformers in vision direct adapter
2024-09-11 08:54:51 -06:00
Jaret Burkett
121a760c19
Added proper grad accumulation
2024-09-03 07:24:18 -06:00
Jaret Burkett
e5fadddd45
Added ability to do prompt attn masking for flux
2024-09-02 17:29:36 -06:00
Jaret Burkett
d44d4eb61a
Added a new experimental linear weighing technique
2024-09-02 09:22:13 -06:00
apolinário
4d35a29c97
Add push_to_hub to the trainer ( #109 )
...
* add push_to_hub
* fix indentation
* indent again
* model_config
* allow samples to not exist
* repo creation fix
* dont show empty [] if widget doesnt exist
* dont submit the config and optimizer
* Unsafe to have tokens saved in the yaml file
* make sure to catch only the latest samples
* change name to slug
* formatting
* formatting
---------
Co-authored-by: multimodalart <joaopaulo.passos+multimodal@gmail.com >
2024-08-22 21:18:56 -06:00
Jaret Burkett
00bd3d54a3
Actually use the save dtype from the config file.
2024-08-13 17:08:27 -06:00
Jaret Burkett
af108bb964
Bug fix with dataloader. Added a flag to completly disable sampling
2024-08-12 09:19:40 -06:00
Jaret Burkett
a6aa4b2c7d
Added ability to set timesteps to linear for flowmatching schedule
2024-08-11 13:06:08 -06:00