Commit Graph

25 Commits

Author SHA1 Message Date
layerdiffusion
abd4d4d83d unify cast name 2024-07-30 08:42:51 -06:00
lllyasviel
056d4d0f91 fix for neveroom extension 2024-02-25 16:47:02 -08:00
lllyasviel
437c348926 Add build-in extension "NeverOOM"
see also discussions
2024-02-24 19:09:06 -08:00
lllyasviel
434ca2169f Add optimization --cuda-stream
See also the readme for more details
2024-02-24 14:00:48 -08:00
lllyasviel
79bdb78619 less verbose 2024-02-23 22:31:16 -08:00
lllyasviel
d508d8132f add cmd flag hints 2024-02-23 20:06:08 -08:00
lllyasviel
88f395091b add two optimizations
--pin-shared-memory and --cuda-malloc

See also the updates in Readme for more details
2024-02-23 18:39:32 -08:00
lllyasviel
54c89503eb Disable pin page
This is an emergency fix

GTX 1060/1050/1066 either does not have shared GPU page vram or have less than 2GB shared page vram - pinning any tensors larger than that will crash

Solution is still under investigation.
2024-02-23 16:43:08 -08:00
lllyasviel
2a7fb1be24 less verbose 2024-02-23 14:22:37 -08:00
lllyasviel
ad0ce480f9 always print time 2024-02-23 13:02:30 -08:00
lllyasviel
df12dde12e Rework unload system
Previous repeated loading (on cn or other extensions) is fixed. ControlNet saves about 0.7 to 1.1 seconds on my two device when batch count > 1. 

8GB VRAM can use SDXL at resolution 6144x6144 now, out of the box, without tiled diffusion or other things. 

(the max resolution on Automatic1111 txt2img UI is 2048 but one can highres fix to try 6144 or even 8192)
2024-02-23 12:58:09 -08:00
lllyasviel
19473b1a26 fix ci 2024-02-23 09:44:08 -08:00
lllyasviel
26c325296e rework memory computation for async loader (#377) 2024-02-23 09:24:39 -08:00
lllyasviel
eacb14e115 tune threshold based on more test devices
for async mover
2024-02-22 17:18:31 -08:00
lllyasviel
8283774b86 revise caster 2024-02-22 10:24:27 -08:00
lllyasviel
167dbc6411 safe value for new memory peak 2024-02-22 06:31:21 -08:00
lllyasviel
4080e25805 add todo note 2024-02-22 00:47:26 -08:00
lllyasviel
3d039591fe Expand head info in files
Previously before this commit, credits are already in entry and licenses are already in root. This commit will make info clearer.
2024-02-11 19:55:18 -08:00
lllyasviel
fb2e271668 support inpaint models from fooocus
put inpaint_v26.fooocus.patch in models\ControlNet, control SDXL models only
To get same algorithm as Fooocus, set "Stop at" (Ending Control Step) to 0.5
Fooocus always use 0.5 but in Forge users may use other values.
Results are best when stop at < 0.7. The model is not optimized with ending timesteps > 0.7
Supports inpaint_global_harmonious, inpaint_only, inpaint_only+lama.
In theory the inpaint_only+lama always outperform Fooocus in object removal task (but not all tasks).
2024-02-09 17:08:48 -08:00
Chenlei Hu
ac4a8820a5 Fix CQ tests (#141)
* Make test client run on cpu

* test on cpu

try fix device

try fix device

try fix device

* Use real SD1.5 model for testing

* ckpt nits

* Remove coverage calls
2024-02-09 16:33:05 -05:00
lllyasviel
40afb9dfb0 backend 2024-02-05 00:45:34 -08:00
lllyasviel
abcfd6bf97 backend 2024-02-04 15:40:02 -08:00
lllyasviel
071be046d2 backend 2024-01-31 09:46:24 -08:00
lllyasviel
3ba5754cc7 i 2024-01-18 04:40:29 -08:00
lllyasviel
7cb6178d47 ldm_patched 2024-01-13 21:03:58 -08:00