Commit Graph

1025 Commits

Author SHA1 Message Date
Jaret Burkett
171535833a Add Mac OS support for Apple Silicon (#770)
* Made an install script and auto updates env for mac

* GPU sensors and initial training working for MAC. Still WIP.

* Switch dataloader to single threaded until I can work around some mac pickeling issues.

* Get quantization working on mac

* Fix mac exclusive imports so they don't break other builds.

* Add mac instructions to the UI
2026-03-30 09:37:47 -06:00
Jaret Burkett
bc47fd6755 Make a requirements base file to make it easier to maintain requirements across platforms. 2026-03-29 14:04:46 -06:00
Jaret Burkett
fbda10d088 Add a duplicate dataset function to the ui 2026-03-29 13:51:18 -06:00
Jaret Burkett
86dcf39eee Allow user to set a training seed via env vars for repeat result testing 2026-03-29 13:34:46 -06:00
Jaret Burkett
45e99664b9 Add icons to the top bar on the job page 2026-03-29 12:38:47 -06:00
Jaret Burkett
540659709d Improved the load time of dataset and sample images and videos by switching to streaming 2026-03-29 10:38:34 -06:00
Jaret Burkett
e030f4f2e0 Show the control images in the image viewer when clicked so they can be easily previewed for reference. 2026-03-29 10:00:54 -06:00
Jaret Burkett
affa411edc Fixed an issue where Flux.2 model VAE can be left offloaded to CPU when encoding control images while caching latents 2026-03-29 09:49:10 -06:00
Jaret Burkett
6a1fc54779 Add t0 loss target 2026-03-28 13:35:21 -06:00
Jaret Burkett
8302b21f8f Version Bump 2026-03-28 13:23:52 -06:00
willhsmit
20929b93df Fix onChange path for EMA Decay input (#695)
Changes to the EMA Decay input don't get preserved when switching back and forth between Advanced and Simple view. I believe the onChange is not writing it correctly here.
2026-03-28 13:02:32 -06:00
abionda-sc
4ef5cbe5bc Fixing bug where width and height are inverted for control image resizing (#707) 2026-03-28 13:00:32 -06:00
Rob Ballantyne
700c4b53d0 Pin timm==1.0.22 (#633)
* Pin timm==1.0.22

* Added timm version pinn to dgx

---------

Co-authored-by: Jaret Burkett <jaretburkett@gmail.com>
2026-03-28 12:52:41 -06:00
Rayane
ca72eb1515 Add 1328 native resolution for Qwen Image training (#749)
* Add 1328 native resolution for Qwen Image training

Qwen-Image and Qwen-Image-2512 have a native 1:1 resolution of 1328x1328
as documented in the official model card's aspect ratio table. Adding it
to the resolution buckets and UI allows training at the model's native
resolution for improved quality.

* Revert example config change (24GB OOM at 1328)
2026-03-28 12:09:15 -06:00
Jaret Burkett
5ce87fa48b Version bump 2026-03-27 20:26:31 -06:00
Jaret Burkett
740657e25e Improve dataset uploader. Upload the files one at a time instead of one huge chunk. Show progress for each file. 2026-03-27 09:26:22 -06:00
Jaret Burkett
f85bf065bf Use pooler embeddings for DFE v6 with dino v3 2026-03-27 07:02:07 -06:00
Jaret Burkett
a802014ec5 Update the torch versions in the README 2026-03-26 12:15:32 -06:00
Jaret Burkett
2782df02c3 Allow HF_HUB_ENABLE_HF_TRANSFER to be set via env variable 2026-03-26 10:45:49 -06:00
Jaret Burkett
2c8d2acdcb On jobs table, sort idle jobs by last updated so recent active ones are at the top 2026-03-26 10:33:17 -06:00
Jaret Burkett
9a77389653 On a new training job, or when editing one, load everything before allowing editing 2026-03-26 10:23:42 -06:00
Jaret Burkett
a7bb4ddb2c Work on loss graph. Add smoothed overlay. Allow user to hilite a secton of the graph to zoom into. 2026-03-26 10:05:09 -06:00
Jaret Burkett
401f7df425 Merge branch 'main' of github.com:ostris/ai-toolkit 2026-03-26 09:11:50 -06:00
Jaret Burkett
4df3b0463f Save job pid to the database and sing sigint to kill it when stopping so it stops immediatly. 2026-03-26 09:10:37 -06:00
科林 KELIN
489b194231 Fix CPU/CUDA device mismatch in Klein edit control image encoding (#742)
When training Klein models with a `control_path` (edit/kontext-style
paired datasets), `encode_image_refs()` returns tensors that reside on
the VAE's device (CPU, since the VAE weights are loaded via
`load_file(..., device="cpu")` and are never explicitly moved to the
training device).  Concatenating those CPU tensors with the training
latents (`packed_latents`) that live on CUDA raises:

    RuntimeError: Expected all tensors to be on the same device

Fix: move `img_cond_seq` and `img_cond_seq_ids` to the same device
(and dtype) as `img_input` / `img_input_ids` before concatenation.

Co-authored-by: HuangYuChuh <HuangYuChuh@users.noreply.github.com>
Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-25 11:45:38 -06:00
Jaret Burkett
89d2090962 Fixed race condition that would occasionally set the dataset path to the first one when editing a job 2026-03-25 11:22:42 -06:00
Jaret Burkett
3f7a3d8d87 Shorten stal action to 3 months 2026-03-25 10:42:18 -06:00
Jaret Burkett
45647c15d3 Added github actions to close stale issues automatically. Hopefully it doesnt break things 2026-03-25 10:25:56 -06:00
Jaret Burkett
899ee528f9 Update git ignore 2026-03-25 10:16:36 -06:00
Jaret Burkett
5d5a8ef9da Fixed issue with deleting datasets and jobs with newer version of node.js. Bumped minimum version of node js to 20 2026-03-25 10:04:36 -06:00
Jaret Burkett
dfde30f231 Fix issue with ltx2 custom te repo path 2026-03-25 09:50:18 -06:00
Jaret Burkett
b8000dbcbc Bump version 2026-03-25 08:18:42 -06:00
Rodrigo Reis
54f4732c9b Fix the bug in temporal_compression data loader (#754) 2026-03-25 08:16:44 -06:00
Jaret Burkett
7f3309b291 Add support for audo frame count so datasets can have varrying length videos. Varous ltx 2.3 VAE optimizations such as removing tiling articacts, and doing frame split encoding to reduce vram on encoding/decoding. 2026-03-24 12:20:09 -06:00
Rasmus Lerdorf
4ad14d211a Add an import config button (#733) 2026-03-23 15:41:27 -06:00
Remix
7a0bbca5b1 Fix random_noise_multiplier (#738)
Apply random_noise_multiplier to noise.
2026-03-23 15:22:16 -06:00
Rayane
99a4a5887b Fix Qwen attention mask crash with diffusers >=0.37 (#748)
* Fix Qwen Image mask handling

* Fix Qwen attention mask crash with diffusers >=0.37

diffusers v0.37 (PR #12987) optimizes all-ones attention masks to None
in encode_prompt() when there is no padding. This breaks ai-toolkit's
Qwen extensions which call .to() on the mask unconditionally.

Fix: reconstruct the all-ones mask at the boundary (get_prompt_embeds)
right after encode_prompt() returns. This keeps the rest of the code
unchanged and works with both old and new diffusers versions.

Also removes redundant duplicate mask assignments in qwen_image_edit
and qwen_image_edit_plus.

Fixes #740
2026-03-23 14:43:08 -06:00
Jaret Burkett
295094b4b5 Fixed new breaking change in diffusers with with qwen image 2026-03-23 14:10:55 -06:00
Jaret Burkett
5642b656b9 Fix audio issues with ltx2 models. Silent codec fails now raised. Auto convert surround sound audio to stereo. Invalidate old caches just to be safe so they recache now. 2026-03-23 20:08:33 +00:00
Jaret Burkett
561e6f201c Fixed an issue with ltx 2.3 i2v training 2026-03-23 12:41:18 -06:00
Jaret Burkett
330059d8a1 version bump 2026-03-23 11:01:16 -06:00
Jaret Burkett
e91827f9be Change gemma repo to lightricks one that is not gated 2026-03-23 11:00:32 -06:00
Jaret Burkett
253cb31362 Fix issue with video and images with no audio on ltx models 2026-03-22 22:09:23 -06:00
Jaret Burkett
4a3d317e2b Fix issue with using the default text encoder with ltx 2.3 2026-03-22 18:53:59 -06:00
Jaret Burkett
859635e95b Add support for training LTX 2.3 (#745)
* Initial support for ltx 2.3. Still needs a lot of testing to make sure it is all right.

* bump version

* Handle lora renaming keys for new ltx 2.3 layers
2026-03-22 17:56:59 -06:00
Jaret Burkett
7e1fdc3844 Remove the 0.1 floor for amplification 2026-03-22 09:01:58 -06:00
Jaret Burkett
0f075fc45e Adjust signal amplification target. Allow signal amplification strength in config. 2026-03-22 08:30:13 -06:00
Jaret Burkett
dcd98dc0d5 Add signal amplification 2026-03-21 07:44:18 -06:00
Jaret Burkett
35b1cde3cb Fixed issue on z-image that prevented training at a larger batch size 2026-03-10 15:43:25 -06:00
Jaret Burkett
4909b809c7 Fixed issue with audio loss multiplier. 2026-03-10 15:16:09 -06:00