stable-diffusion-webui-forge

mirror of https://github.com/lllyasviel/stable-diffusion-webui-forge.git synced 2026-02-27 18:23:57 +00:00

Author	SHA1	Message	Date
layerdiffusion	d38e560e42	Implement some rethinking about LoRA system 1. Add an option to allow users to use UNet in fp8/gguf but lora in fp16. 2. All FP16 loras do not need patch. Others will only patch again when lora weight change. 3. FP8 unet + fp16 lora are available (somewhat only available) in Forge now. This also solves some “LoRA too subtle” problems. 4. Significantly speed up all gguf models (in Async mode) by using independent thread (CUDA stream) to compute and dequant at the same time, even when low-bit weights are already on GPU. 5. View “online lora” as a module similar to ControlLoRA so that it is moved to GPU together with model when sampling, achieving significant speedup and perfect low VRAM management simultaneously.	2024-08-19 04:31:59 -07:00
layerdiffusion	12369669cf	only load lora one time	2024-08-16 02:02:22 -07:00
layerdiffusion	f510f51303	speedup lora patching	2024-08-15 06:51:52 -07:00
layerdiffusion	141cf81c23	sometimes it is not diffusion model	2024-08-15 06:36:59 -07:00
layerdiffusion	021428da26	fix nf4 lora gives pure noise on some devices	2024-08-15 06:35:15 -07:00
layerdiffusion	3d751eb69f	move file	2024-08-15 05:46:35 -07:00
layerdiffusion	1bd6cf0e0c	Support LoRAs for Q8/Q5/Q4 GGUF Models what a crazy night of math	2024-08-15 05:34:46 -07:00
layerdiffusion	d336597fa5	add note to lora but loras for NF4 is done already!	2024-08-15 00:42:48 -07:00
layerdiffusion	59790f2cb4	simplify codes	2024-08-14 20:48:39 -07:00
layerdiffusion	4b66cf1126	fix possible OOM again	2024-08-14 20:45:58 -07:00
layerdiffusion	a29875206f	Revert "simplify codes" This reverts commit `e7567efd4b`.	2024-08-14 20:39:05 -07:00
layerdiffusion	e7567efd4b	simplify codes	2024-08-14 20:34:02 -07:00
layerdiffusion	bbd0d76b28	fix possible oom	2024-08-14 20:27:05 -07:00
layerdiffusion	cb889470ba	experimental LoRA support for NF4 Model method may change later depending on result quality	2024-08-14 19:52:19 -07:00
layerdiffusion	70a5acd8ad	doc	2024-08-14 19:12:02 -07:00
layerdiffusion	aff742b597	speed up lora using cuda profile	2024-08-14 19:09:35 -07:00
layerdiffusion	c73dd119be	typo	2024-08-13 16:03:17 -07:00
layerdiffusion	88d0300883	add note	2024-08-13 16:02:40 -07:00
layerdiffusion	a0849953bd	revise	2024-08-13 15:13:39 -07:00
layerdiffusion	d1b8a2676d	rework lora and patching system and dora etc - backend rework is 60% finished And I also removed the webui’s extremely annoying lora filter from model versions.	2024-08-02 13:45:26 -07:00

20 Commits