Commit Graph

1046 Commits

Author SHA1 Message Date
Jaret Burkett
bfb373c8fa Prep for future breaking changes in newer versions of transformers library 2026-04-10 12:04:32 -06:00
Jaret Burkett
145144eee3 Fix issue with auto updating captions when captioning a dataset 2026-04-10 11:51:18 -06:00
Jaret Burkett
765a9d5b2e Add a download button to music samples in in the gear menu 2026-04-10 10:07:45 -06:00
Jaret Burkett
d08ea8318f Change default timestep type for ace step to linear 2026-04-09 17:39:00 -06:00
Jaret Burkett
78cf049c29 Add support for ACE-Step 1.5 and ACE-Step 1.5 XL. Also added dataset captioning through the UI. (#785)
* Base ace step 1.5 xl added. Generating, still wip on training and ui

* Base training code done

* Fix some issues with caching text embeddings. Update sample cards to show audio

* Fix issue with quantizing ace step

* Add album artwork to samples with waveform.

* Cleanup logs

* Add album art endpoint to speed up album art loading

* Made an make video with artwork script

* Make ui handle basic audio models. Make multi line adjustments to the editor and better syntax hilighting.

* Add prompt tagging system for special tagged models.

* prompt tagging processing for ui working.

* Moved default samples to a special file so we can add more when needed and they can be adjusted for a specific model

* Add a captioner job with music captioner that is prepped for use with the ui

* Add basit ui setup for captioning modal and handeling captioning jobs

* Starting captioning job from ui working. Still better management for it.

* Better filtering of job options in the job view for captioning jobs

* Added qwen3 vl as a captioner for images

* Have an indicator when a dataset is being captioned.

* Adjust the way caption jobs look in the queue

* Fix a few issues. Adjust defaults.

* Version bump

* Added ace step to the readme.
2026-04-09 15:02:03 -06:00
Jaret Burkett
9ca58e9aa2 Fixed offload and quantize order of ltx 2.3 text encoder. 2026-04-07 15:11:50 -06:00
Jaret Burkett
0dcbabf6af Fix merge nertwork ref 2026-04-01 10:38:31 -06:00
M. Hofer
f213e3b1e5 Fix FLUX2 Klein load-time VRAM spikes on low-memory GPUs. (#726)
Keep the transformer and Qwen text encoder off CUDA during initial load/quantization in low-VRAM mode so model startup avoids full-model OOM before offloading and quantization can take effect.

Co-authored-by: Cursor <cursoragent@cursor.com>
Co-authored-by: Jaret Burkett <jaretburkett@gmail.com>
2026-04-01 09:36:55 -06:00
Jaret Burkett
da2a79590f Add a merge network on save strength 2026-04-01 09:21:08 -06:00
Jaret Burkett
853ffaf207 Add light mode support. 2026-03-31 16:54:55 -06:00
Jaret Burkett
ad474e3d06 Update and reformat the readme 2026-03-31 12:31:05 -06:00
Jaret Burkett
4a3251640a More work on compiling models 2026-03-31 12:11:56 -06:00
Jaret Burkett
358d684f6f Move compiiling the model after accelerate manipulation 2026-03-31 09:52:27 -06:00
Jaret Burkett
0045260af7 Fix issue where compile true did not actually compile the model 2026-03-31 09:27:54 -06:00
Jaret Burkett
e22039e4aa Add more optimizers to the ui 2026-03-31 09:20:20 -06:00
Jaret Burkett
bf56217c37 Fixed issue where job would fail if DB is locked. 2026-03-31 09:10:33 -06:00
Jaret Burkett
dcb7f465ec Version Bump 2026-03-30 15:57:40 -06:00
Jaret Burkett
626d9674ea Add info about automated coding agent pull requests. So sick of them. 2026-03-30 11:02:12 -06:00
Jaret Burkett
b43ea6c2d3 Abort caption requests when they are not in view to tax the server less. 2026-03-30 10:46:25 -06:00
Jaret Burkett
a484e55d66 Rework dataset model and file dragging. Use single model for dragging, uploading, and selecting images. 2026-03-30 10:33:10 -06:00
Jaret Burkett
ac82ebd852 Dont try to list hidden files in datasets 2026-03-30 10:03:10 -06:00
Jaret Burkett
171535833a Add Mac OS support for Apple Silicon (#770)
* Made an install script and auto updates env for mac

* GPU sensors and initial training working for MAC. Still WIP.

* Switch dataloader to single threaded until I can work around some mac pickeling issues.

* Get quantization working on mac

* Fix mac exclusive imports so they don't break other builds.

* Add mac instructions to the UI
2026-03-30 09:37:47 -06:00
Jaret Burkett
bc47fd6755 Make a requirements base file to make it easier to maintain requirements across platforms. 2026-03-29 14:04:46 -06:00
Jaret Burkett
fbda10d088 Add a duplicate dataset function to the ui 2026-03-29 13:51:18 -06:00
Jaret Burkett
86dcf39eee Allow user to set a training seed via env vars for repeat result testing 2026-03-29 13:34:46 -06:00
Jaret Burkett
45e99664b9 Add icons to the top bar on the job page 2026-03-29 12:38:47 -06:00
Jaret Burkett
540659709d Improved the load time of dataset and sample images and videos by switching to streaming 2026-03-29 10:38:34 -06:00
Jaret Burkett
e030f4f2e0 Show the control images in the image viewer when clicked so they can be easily previewed for reference. 2026-03-29 10:00:54 -06:00
Jaret Burkett
affa411edc Fixed an issue where Flux.2 model VAE can be left offloaded to CPU when encoding control images while caching latents 2026-03-29 09:49:10 -06:00
Jaret Burkett
6a1fc54779 Add t0 loss target 2026-03-28 13:35:21 -06:00
Jaret Burkett
8302b21f8f Version Bump 2026-03-28 13:23:52 -06:00
willhsmit
20929b93df Fix onChange path for EMA Decay input (#695)
Changes to the EMA Decay input don't get preserved when switching back and forth between Advanced and Simple view. I believe the onChange is not writing it correctly here.
2026-03-28 13:02:32 -06:00
abionda-sc
4ef5cbe5bc Fixing bug where width and height are inverted for control image resizing (#707) 2026-03-28 13:00:32 -06:00
Rob Ballantyne
700c4b53d0 Pin timm==1.0.22 (#633)
* Pin timm==1.0.22

* Added timm version pinn to dgx

---------

Co-authored-by: Jaret Burkett <jaretburkett@gmail.com>
2026-03-28 12:52:41 -06:00
Rayane
ca72eb1515 Add 1328 native resolution for Qwen Image training (#749)
* Add 1328 native resolution for Qwen Image training

Qwen-Image and Qwen-Image-2512 have a native 1:1 resolution of 1328x1328
as documented in the official model card's aspect ratio table. Adding it
to the resolution buckets and UI allows training at the model's native
resolution for improved quality.

* Revert example config change (24GB OOM at 1328)
2026-03-28 12:09:15 -06:00
Jaret Burkett
5ce87fa48b Version bump 2026-03-27 20:26:31 -06:00
Jaret Burkett
740657e25e Improve dataset uploader. Upload the files one at a time instead of one huge chunk. Show progress for each file. 2026-03-27 09:26:22 -06:00
Jaret Burkett
f85bf065bf Use pooler embeddings for DFE v6 with dino v3 2026-03-27 07:02:07 -06:00
Jaret Burkett
a802014ec5 Update the torch versions in the README 2026-03-26 12:15:32 -06:00
Jaret Burkett
2782df02c3 Allow HF_HUB_ENABLE_HF_TRANSFER to be set via env variable 2026-03-26 10:45:49 -06:00
Jaret Burkett
2c8d2acdcb On jobs table, sort idle jobs by last updated so recent active ones are at the top 2026-03-26 10:33:17 -06:00
Jaret Burkett
9a77389653 On a new training job, or when editing one, load everything before allowing editing 2026-03-26 10:23:42 -06:00
Jaret Burkett
a7bb4ddb2c Work on loss graph. Add smoothed overlay. Allow user to hilite a secton of the graph to zoom into. 2026-03-26 10:05:09 -06:00
Jaret Burkett
401f7df425 Merge branch 'main' of github.com:ostris/ai-toolkit 2026-03-26 09:11:50 -06:00
Jaret Burkett
4df3b0463f Save job pid to the database and sing sigint to kill it when stopping so it stops immediatly. 2026-03-26 09:10:37 -06:00
科林 KELIN
489b194231 Fix CPU/CUDA device mismatch in Klein edit control image encoding (#742)
When training Klein models with a `control_path` (edit/kontext-style
paired datasets), `encode_image_refs()` returns tensors that reside on
the VAE's device (CPU, since the VAE weights are loaded via
`load_file(..., device="cpu")` and are never explicitly moved to the
training device).  Concatenating those CPU tensors with the training
latents (`packed_latents`) that live on CUDA raises:

    RuntimeError: Expected all tensors to be on the same device

Fix: move `img_cond_seq` and `img_cond_seq_ids` to the same device
(and dtype) as `img_input` / `img_input_ids` before concatenation.

Co-authored-by: HuangYuChuh <HuangYuChuh@users.noreply.github.com>
Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-25 11:45:38 -06:00
Jaret Burkett
89d2090962 Fixed race condition that would occasionally set the dataset path to the first one when editing a job 2026-03-25 11:22:42 -06:00
Jaret Burkett
3f7a3d8d87 Shorten stal action to 3 months 2026-03-25 10:42:18 -06:00
Jaret Burkett
45647c15d3 Added github actions to close stale issues automatically. Hopefully it doesnt break things 2026-03-25 10:25:56 -06:00
Jaret Burkett
899ee528f9 Update git ignore 2026-03-25 10:16:36 -06:00