Update README.md

This commit is contained in:
turboderp
2023-09-12 06:52:04 +02:00
parent 546c91482e
commit 8e3cd01889

View File

@@ -56,11 +56,15 @@ models and various other finetunes. You can also provide a custom system prompt
### Installation
Clone the repository and run `python setup.py install --user`. (PyPi package is coming, be patient.)
To install as a library (not required for the included examples), clone the repository and run:
```
python setup.py install --user`.
```
ExLlamaV2 relies on a Torch C++ extension for its CUDA functions, which is compiled at runtime. This means the first
time the library is used it will take 10-20 seconds (depending on your hardware) to start, but the extension gets cached
for subsequent use.
for subsequent use. A PyPI package will evantually be available with an option to install a precompiled extension.
## EXL2 quantization