stable-diffusion-webui-forge

mirror of https://github.com/lllyasviel/stable-diffusion-webui-forge.git synced 2026-02-24 08:43:57 +00:00

Files

layerdiffusion 82dfc2b15b Significantly speed up Q4_0, Q4_1, Q4_K

by precomputing all possible 4bit dequant into a lookup table and use pytorch indexing to get dequant, rather than really computing the bit operations.
This should give very similar performance to native CUDA kernels, while being LoRA friendly and more flexiable

2024-08-25 16:49:33 -07:00

comfyui_lora_collection

multiple lora implementation sources

2024-08-13 07:13:32 -07:00

gguf

Significantly speed up Q4_0, Q4_1, Q4_K

2024-08-25 16:49:33 -07:00

webui_lora_collection

multiple lora implementation sources

2024-08-13 07:13:32 -07:00

README.md

multiple lora implementation sources

2024-08-13 07:13:32 -07:00

README.md

Please follow the standard of 315f85d4f4/3rdparty when PR or modifying files.