Commit Graph

245 Commits

Author SHA1 Message Date
Jaret Burkett
1c96b95617 Fix issue where sometimes the transformer does not get loaded properly. 2025-08-14 14:24:41 -06:00
Jaret Burkett
3413fa537f Wan22 14b training is working, still need tons of testing and some bug fixes 2025-08-14 13:03:27 -06:00
Jaret Burkett
259d68d440 Added a flushg during sampling to prevent spikes on low vram qwen 2025-08-12 12:57:18 -06:00
Jaret Burkett
77b10d884d Add support for training with an accuracy recovery adapter with qwen image 2025-08-12 08:21:36 -06:00
Jaret Burkett
4ad18f3d00 Clip max token embeddings to the max rope length for qwen image to solve for an error for super long captions > 1024 2025-08-10 08:44:41 -06:00
Jaret Burkett
f0105c33a7 Fixed issue that sometimes happens in qwen image where text seq length is wrong 2025-08-09 16:33:05 -06:00
Jaret Burkett
bb6db3d635 Added support for caching text embeddings. This is just initial support and will probably fail for some models. Still needs to be ompimized 2025-08-07 10:27:55 -06:00
Jaret Burkett
4c4a10d439 Remove vision model from qwen text encoder as it is not needed for image generation currently 2025-08-06 11:40:02 -06:00
Jaret Burkett
14ccf2f3ce Refactor qwen5b model code to be qwen 5b specific 2025-08-06 10:54:56 -06:00
Jaret Burkett
5d8922fca2 Add ability to designate a dataset as i2v or t2v for models that support it 2025-08-06 09:29:47 -06:00
Jaret Burkett
93202c7a2b Training working for Qwen Image 2025-08-04 21:14:30 +00:00
Jaret Burkett
9da8b5408e Initial but untested support for qwen_image 2025-08-04 13:29:37 -06:00
Jaret Burkett
a558d5b68f Move transformer back to device on aggresive wan 2.2 pipeline after generation. 2025-07-29 09:13:47 -06:00
Jaret Burkett
1d1199b15b Fix bug that prevented training wan 2.2 with batch size greater than 1 2025-07-29 09:06:25 -06:00
Jaret Burkett
ca7c5c950b Add support for Wan2.2 5B 2025-07-29 05:31:54 -06:00
Jaret Burkett
cefa2ca5fe Added initial support for Hidream E1 training 2025-07-27 15:12:56 -06:00
Daniel Verdu
a77ba5a089 fix: Guidance incorrect shape 2025-07-18 12:49:18 +02:00
Jaret Burkett
611969ec1f Allow control image for omnigen training and sampling 2025-07-09 13:54:55 -06:00
Jaret Burkett
bbb57de6ec Speed up omnigen TE loading 2025-07-05 09:32:00 -06:00
Jaret Burkett
5906a76666 Fixed issue with flux kontext forcing generation image sizes 2025-06-29 05:38:20 -06:00
Jaret Burkett
57a81bc0db Update base model version for kontext meta 2025-06-28 14:48:36 -06:00
Jaret Burkett
01a3c8a9b1 Fix device issue 2025-06-26 19:14:25 -06:00
Jaret Burkett
4f91cb7148 Fix issue with gradient checkpointing and flux kontext 2025-06-26 19:03:12 -06:00
Jaret Burkett
446b0b6989 Remove revision for kontext 2025-06-26 16:46:58 -06:00
Jaret Burkett
60ef2f1df7 Added support for FLUX.1-Kontext-dev 2025-06-26 15:24:37 -06:00
Jaret Burkett
8d9c47316a Work on mean flow. Minor bug fixes. Omnigen improvements 2025-06-26 13:46:20 -06:00
Jaret Burkett
84c6edca7e Merge branch 'main' into dev 2025-06-25 14:10:25 -06:00
Jaret Burkett
19ea8ecc38 Added support for finetuning OmniGen2. 2025-06-25 13:58:16 -06:00
Jaret Burkett
18513ec866 Merged in from main 2025-06-24 10:56:54 -06:00
Jaret Burkett
f3eb1dff42 Add a config flag to trigger fast image size db builder. Add config flag to set unconditional prompt for guidance loss 2025-06-24 08:51:29 -06:00
Jaret Burkett
ba1274d99e Added a guidance burning loss. Modified DFE to work with new model. Bug fixes 2025-06-23 08:38:27 -06:00
Jaret Burkett
8602470952 Updated diffusion feature extractor 2025-06-19 15:36:10 -06:00
Jaret Burkett
1cc663a664 Performance optimizations for pre processing the batch 2025-06-17 07:37:41 -06:00
Jaret Burkett
1c2b7298dd More work on mean flow loss. Moved it to an adapter. Still not functioning properly though. 2025-06-16 07:17:35 -06:00
Jaret Burkett
c0314ba325 Fixed some issues with training mean flow algo. Still testing WIP 2025-06-16 07:14:59 -06:00
Jaret Burkett
cbf04b8d53 Fixed some issues with training mean flow algo. Still testing WIP 2025-06-14 12:24:00 -06:00
Jaret Burkett
fc83eb7691 WIP on mean flow loss. Still a WIP. 2025-06-12 08:00:51 -06:00
Jaret Burkett
eefa93f16e Various code to support experiments. 2025-06-09 11:19:21 -06:00
Jaret Burkett
22cdfadab6 Added new timestep weighing strategy 2025-06-04 01:16:02 -06:00
Jaret Burkett
adc31ec77d Small updates and bug fixes for various things 2025-06-03 20:08:35 -06:00
Jaret Burkett
ffaf2f154a Fix issue with the way chroma handled gradient checkpointing. 2025-05-28 08:41:47 -06:00
Jaret Burkett
79bb9be92b Fix issue with saving chroma full finetune. 2025-05-28 07:42:30 -06:00
Jaret Burkett
79499fa795 Allow fine tuning pruned versions of chroma. Allow flash attention 2 for chroma if it is installed. 2025-05-21 07:02:50 -06:00
Jaret Burkett
e5181d23cd Added some experimental training techniques. Ignore for now. Still in testing. 2025-05-21 02:19:54 -06:00
Jaret Burkett
6174ba474e Fixed issue with chroma sampling 2025-05-10 18:30:23 +00:00
Jaret Burkett
43cb5603ad Added chroma model to the ui. Added logic to easily pull latest, use local, or use a specific version of chroma. Allow ustom name or path in the ui for custom models 2025-05-07 12:06:30 -06:00
Jaret Burkett
d9700bdb99 Added initial support for f-lite model 2025-05-01 11:15:18 -06:00
Jaret Burkett
5890e67a46 Various bug fixes 2025-04-29 09:30:33 -06:00
Jaret Burkett
add83df5cc Fixed issue with training hidream when batch size is larger than 1 2025-04-21 17:26:29 +00:00
Jaret Burkett
77001ee77f Upodate model tag on loras 2025-04-19 10:41:27 -06:00