layerdiffusion
|
95e16f7204
|
maintain loading related
1. revise model moving orders
2. less verbose printing
3. some misc minor speedups
4. some bnb related maintain
|
2024-08-29 19:05:48 -07:00 |
|
layerdiffusion
|
0abb6c4686
|
Second Attempt for #1502
|
2024-08-28 08:08:40 -07:00 |
|
layerdiffusion
|
14eac6f2cf
|
add a way to empty cuda cache on the fly
|
2024-08-22 10:06:39 -07:00 |
|
layerdiffusion
|
8a04293430
|
fix some gguf loras
|
2024-08-17 01:15:37 -07:00 |
|
layerdiffusion
|
2f0555f7dc
|
GPU Shared Async Swap for all GGUF/BNB
|
2024-08-16 08:45:17 -07:00 |
|
layerdiffusion
|
04e7f05769
|
speedup swap/loading of all quant types
|
2024-08-16 08:30:11 -07:00 |
|
layerdiffusion
|
cb889470ba
|
experimental LoRA support for NF4 Model
method may change later depending on result quality
|
2024-08-14 19:52:19 -07:00 |
|
layerdiffusion
|
bb28bc382b
|
turn off second compress by default
|
2024-08-13 21:08:37 -07:00 |
|
lllyasviel
|
cfa5242a75
|
forge 2.0.0
see also discussions
|
2024-08-10 19:24:19 -07:00 |
|
layerdiffusion
|
314832301a
|
upload files
|
2024-08-08 20:44:47 -07:00 |
|