ai-toolkit

mirror of https://github.com/ostris/ai-toolkit.git synced 2026-05-12 00:42:07 +00:00

Author	SHA1	Message	Date
Jaret Burkett	171535833a	Add Mac OS support for Apple Silicon (#770 ) * Made an install script and auto updates env for mac * GPU sensors and initial training working for MAC. Still WIP. * Switch dataloader to single threaded until I can work around some mac pickeling issues. * Get quantization working on mac * Fix mac exclusive imports so they don't break other builds. * Add mac instructions to the UI	2026-03-30 09:37:47 -06:00
Jaret Burkett	bc47fd6755	Make a requirements base file to make it easier to maintain requirements across platforms.	2026-03-29 14:04:46 -06:00
Jaret Burkett	fbda10d088	Add a duplicate dataset function to the ui	2026-03-29 13:51:18 -06:00
Jaret Burkett	86dcf39eee	Allow user to set a training seed via env vars for repeat result testing	2026-03-29 13:34:46 -06:00
Jaret Burkett	45e99664b9	Add icons to the top bar on the job page	2026-03-29 12:38:47 -06:00
Jaret Burkett	540659709d	Improved the load time of dataset and sample images and videos by switching to streaming	2026-03-29 10:38:34 -06:00
Jaret Burkett	e030f4f2e0	Show the control images in the image viewer when clicked so they can be easily previewed for reference.	2026-03-29 10:00:54 -06:00
Jaret Burkett	affa411edc	Fixed an issue where Flux.2 model VAE can be left offloaded to CPU when encoding control images while caching latents	2026-03-29 09:49:10 -06:00
Jaret Burkett	6a1fc54779	Add t0 loss target	2026-03-28 13:35:21 -06:00
Jaret Burkett	8302b21f8f	Version Bump	2026-03-28 13:23:52 -06:00
willhsmit	20929b93df	Fix onChange path for EMA Decay input (#695 ) Changes to the EMA Decay input don't get preserved when switching back and forth between Advanced and Simple view. I believe the onChange is not writing it correctly here.	2026-03-28 13:02:32 -06:00
abionda-sc	4ef5cbe5bc	Fixing bug where width and height are inverted for control image resizing (#707 )	2026-03-28 13:00:32 -06:00
Rob Ballantyne	700c4b53d0	Pin timm==1.0.22 (#633 ) * Pin timm==1.0.22 * Added timm version pinn to dgx --------- Co-authored-by: Jaret Burkett <jaretburkett@gmail.com>	2026-03-28 12:52:41 -06:00
Rayane	ca72eb1515	Add 1328 native resolution for Qwen Image training (#749 ) * Add 1328 native resolution for Qwen Image training Qwen-Image and Qwen-Image-2512 have a native 1:1 resolution of 1328x1328 as documented in the official model card's aspect ratio table. Adding it to the resolution buckets and UI allows training at the model's native resolution for improved quality. * Revert example config change (24GB OOM at 1328)	2026-03-28 12:09:15 -06:00
Jaret Burkett	5ce87fa48b	Version bump	2026-03-27 20:26:31 -06:00
Jaret Burkett	740657e25e	Improve dataset uploader. Upload the files one at a time instead of one huge chunk. Show progress for each file.	2026-03-27 09:26:22 -06:00
Jaret Burkett	f85bf065bf	Use pooler embeddings for DFE v6 with dino v3	2026-03-27 07:02:07 -06:00
Jaret Burkett	a802014ec5	Update the torch versions in the README	2026-03-26 12:15:32 -06:00
Jaret Burkett	2782df02c3	Allow HF_HUB_ENABLE_HF_TRANSFER to be set via env variable	2026-03-26 10:45:49 -06:00
Jaret Burkett	2c8d2acdcb	On jobs table, sort idle jobs by last updated so recent active ones are at the top	2026-03-26 10:33:17 -06:00
Jaret Burkett	9a77389653	On a new training job, or when editing one, load everything before allowing editing	2026-03-26 10:23:42 -06:00
Jaret Burkett	a7bb4ddb2c	Work on loss graph. Add smoothed overlay. Allow user to hilite a secton of the graph to zoom into.	2026-03-26 10:05:09 -06:00
Jaret Burkett	401f7df425	Merge branch 'main' of github.com:ostris/ai-toolkit	2026-03-26 09:11:50 -06:00
Jaret Burkett	4df3b0463f	Save job pid to the database and sing sigint to kill it when stopping so it stops immediatly.	2026-03-26 09:10:37 -06:00
科林 KELIN	489b194231	Fix CPU/CUDA device mismatch in Klein edit control image encoding (#742 ) When training Klein models with a `control_path` (edit/kontext-style paired datasets), `encode_image_refs()` returns tensors that reside on the VAE's device (CPU, since the VAE weights are loaded via `load_file(..., device="cpu")` and are never explicitly moved to the training device). Concatenating those CPU tensors with the training latents (`packed_latents`) that live on CUDA raises: RuntimeError: Expected all tensors to be on the same device Fix: move `img_cond_seq` and `img_cond_seq_ids` to the same device (and dtype) as `img_input` / `img_input_ids` before concatenation. Co-authored-by: HuangYuChuh <HuangYuChuh@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-25 11:45:38 -06:00
Jaret Burkett	89d2090962	Fixed race condition that would occasionally set the dataset path to the first one when editing a job	2026-03-25 11:22:42 -06:00
Jaret Burkett	3f7a3d8d87	Shorten stal action to 3 months	2026-03-25 10:42:18 -06:00
Jaret Burkett	45647c15d3	Added github actions to close stale issues automatically. Hopefully it doesnt break things	2026-03-25 10:25:56 -06:00
Jaret Burkett	899ee528f9	Update git ignore	2026-03-25 10:16:36 -06:00
Jaret Burkett	5d5a8ef9da	Fixed issue with deleting datasets and jobs with newer version of node.js. Bumped minimum version of node js to 20	2026-03-25 10:04:36 -06:00
Jaret Burkett	dfde30f231	Fix issue with ltx2 custom te repo path	2026-03-25 09:50:18 -06:00
Jaret Burkett	b8000dbcbc	Bump version	2026-03-25 08:18:42 -06:00
Rodrigo Reis	54f4732c9b	Fix the bug in temporal_compression data loader (#754 )	2026-03-25 08:16:44 -06:00
Jaret Burkett	7f3309b291	Add support for audo frame count so datasets can have varrying length videos. Varous ltx 2.3 VAE optimizations such as removing tiling articacts, and doing frame split encoding to reduce vram on encoding/decoding.	2026-03-24 12:20:09 -06:00
Rasmus Lerdorf	4ad14d211a	Add an import config button (#733 )	2026-03-23 15:41:27 -06:00
Remix	7a0bbca5b1	Fix random_noise_multiplier (#738 ) Apply random_noise_multiplier to noise.	2026-03-23 15:22:16 -06:00
Rayane	99a4a5887b	Fix Qwen attention mask crash with diffusers >=0.37 (#748 ) * Fix Qwen Image mask handling * Fix Qwen attention mask crash with diffusers >=0.37 diffusers v0.37 (PR #12987) optimizes all-ones attention masks to None in encode_prompt() when there is no padding. This breaks ai-toolkit's Qwen extensions which call .to() on the mask unconditionally. Fix: reconstruct the all-ones mask at the boundary (get_prompt_embeds) right after encode_prompt() returns. This keeps the rest of the code unchanged and works with both old and new diffusers versions. Also removes redundant duplicate mask assignments in qwen_image_edit and qwen_image_edit_plus. Fixes #740	2026-03-23 14:43:08 -06:00
Jaret Burkett	295094b4b5	Fixed new breaking change in diffusers with with qwen image	2026-03-23 14:10:55 -06:00
Jaret Burkett	5642b656b9	Fix audio issues with ltx2 models. Silent codec fails now raised. Auto convert surround sound audio to stereo. Invalidate old caches just to be safe so they recache now.	2026-03-23 20:08:33 +00:00
Jaret Burkett	561e6f201c	Fixed an issue with ltx 2.3 i2v training	2026-03-23 12:41:18 -06:00
Jaret Burkett	330059d8a1	version bump	2026-03-23 11:01:16 -06:00
Jaret Burkett	e91827f9be	Change gemma repo to lightricks one that is not gated	2026-03-23 11:00:32 -06:00
Jaret Burkett	253cb31362	Fix issue with video and images with no audio on ltx models	2026-03-22 22:09:23 -06:00
Jaret Burkett	4a3d317e2b	Fix issue with using the default text encoder with ltx 2.3	2026-03-22 18:53:59 -06:00
Jaret Burkett	859635e95b	Add support for training LTX 2.3 (#745 ) * Initial support for ltx 2.3. Still needs a lot of testing to make sure it is all right. * bump version * Handle lora renaming keys for new ltx 2.3 layers	2026-03-22 17:56:59 -06:00
Jaret Burkett	7e1fdc3844	Remove the 0.1 floor for amplification	2026-03-22 09:01:58 -06:00
Jaret Burkett	0f075fc45e	Adjust signal amplification target. Allow signal amplification strength in config.	2026-03-22 08:30:13 -06:00
Jaret Burkett	dcd98dc0d5	Add signal amplification	2026-03-21 07:44:18 -06:00
Jaret Burkett	35b1cde3cb	Fixed issue on z-image that prevented training at a larger batch size	2026-03-10 15:43:25 -06:00
Jaret Burkett	4909b809c7	Fixed issue with audio loss multiplier.	2026-03-10 15:16:09 -06:00

1 2 3 4 5 ...

1025 Commits