Commit Graph

19 Commits

Author SHA1 Message Date
layerdiffusion
12369669cf only load lora one time 2024-08-16 02:02:22 -07:00
layerdiffusion
f510f51303 speedup lora patching 2024-08-15 06:51:52 -07:00
layerdiffusion
141cf81c23 sometimes it is not diffusion model 2024-08-15 06:36:59 -07:00
layerdiffusion
021428da26 fix nf4 lora gives pure noise on some devices 2024-08-15 06:35:15 -07:00
layerdiffusion
3d751eb69f move file 2024-08-15 05:46:35 -07:00
layerdiffusion
1bd6cf0e0c Support LoRAs for Q8/Q5/Q4 GGUF Models
what a crazy night of math
2024-08-15 05:34:46 -07:00
layerdiffusion
d336597fa5 add note to lora
but loras for NF4 is done already!
2024-08-15 00:42:48 -07:00
layerdiffusion
59790f2cb4 simplify codes 2024-08-14 20:48:39 -07:00
layerdiffusion
4b66cf1126 fix possible OOM again 2024-08-14 20:45:58 -07:00
layerdiffusion
a29875206f Revert "simplify codes"
This reverts commit e7567efd4b.
2024-08-14 20:39:05 -07:00
layerdiffusion
e7567efd4b simplify codes 2024-08-14 20:34:02 -07:00
layerdiffusion
bbd0d76b28 fix possible oom 2024-08-14 20:27:05 -07:00
layerdiffusion
cb889470ba experimental LoRA support for NF4 Model
method may change later depending on result quality
2024-08-14 19:52:19 -07:00
layerdiffusion
70a5acd8ad doc 2024-08-14 19:12:02 -07:00
layerdiffusion
aff742b597 speed up lora using cuda profile 2024-08-14 19:09:35 -07:00
layerdiffusion
c73dd119be typo 2024-08-13 16:03:17 -07:00
layerdiffusion
88d0300883 add note 2024-08-13 16:02:40 -07:00
layerdiffusion
a0849953bd revise 2024-08-13 15:13:39 -07:00
layerdiffusion
d1b8a2676d rework lora and patching system
and dora etc - backend rework is 60% finished
And I also removed the webui’s extremely annoying lora filter from model versions.
2024-08-02 13:45:26 -07:00