Update README.md

2026-04-20 14:29:28 +00:00 · 2023-09-12 06:52:04 +02:00
parent 546c91482e
commit 8e3cd01889
1 changed files with 6 additions and 2 deletions
--- a/README.md
+++ b/README.md
@@ -56,11 +56,15 @@ models and various other finetunes. You can also provide a custom system prompt

 ### Installation

-Clone the repository and run `python setup.py install --user`. (PyPi package is coming, be patient.)
+To install as a library (not required for the included examples), clone the repository and run:
+
+```
+python setup.py install --user`.
+```

 ExLlamaV2 relies on a Torch C++ extension for its CUDA functions, which is compiled at runtime. This means the first
 time the library is used it will take 10-20 seconds (depending on your hardware) to start, but the extension gets cached
-for subsequent use.
+for subsequent use. A PyPI package will evantually be available with an option to install a precompiled extension. 


 ## EXL2 quantization