Commit Graph

296 Commits

Author SHA1 Message Date
Georgi Gerganov
eed44b2875 Update hot topics to mention Alpaca support 2023-03-19 19:51:55 +02:00
Georgi Gerganov
b3bd91ce9d Add instruction for using Alpaca (#240) 2023-03-19 18:49:50 +02:00
Pavol Rusnak
f9cb6f8979 Fix typo in readme 2023-03-18 23:18:04 +01:00
Pavol Rusnak
cc4ace10bf Add note about Python 3.11 to readme 2023-03-18 22:25:35 +01:00
Pavol Rusnak
0bfb4f160f Add memory/disk requirements to readme 2023-03-18 22:25:35 +01:00
Georgi Gerganov
b57c1e4295 Update Contributing section 2023-03-17 20:30:04 +02:00
Stephan Walter
45113b2f42 Don't tell users to use a bad number of threads (#243)
The readme tells people to use the command line option "-t 8", causing 8
threads to be started. On systems with fewer than 8 cores, this causes a
significant slowdown. Remove the option from the example command lines
and use /proc/cpuinfo on Linux to determine a sensible default.
2023-03-17 19:47:35 +02:00
Bernat Vadell
afcd16588e 🚀 Dockerize llamacpp (#132)
* feat: dockerize llamacpp

* feat: split build & runtime stages

* split dockerfile into main & tools

* add quantize into tool docker image

* Update .devops/tools.sh

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

* add docker action pipeline

* change CI to publish at github docker registry

* fix name runs-on macOS-latest is macos-latest (lowercase)

* include docker versioned images

* fix github action docker

* fix docker.yml

* feat: include all-in-one command tool & update readme.md

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2023-03-17 10:47:06 +01:00
Georgi Gerganov
b73c11ea8c Update README.md 2023-03-16 15:00:09 +02:00
Georgi Gerganov
be7fc04a86 Expand "Contributing" section 2023-03-16 08:55:13 +02:00
Georgi Gerganov
e27ab8cb60 Update hot topics - RMSnorm 2023-03-16 07:12:12 +02:00
moritzbrantner
3ffbb46e32 fixed typo (#178) 2023-03-15 22:35:25 +02:00
Musab Gultekin
3a59f2ef9b Fix potential licensing issue (#126)
* Update README.md

* Update README.md

remove facebook
2023-03-15 21:39:06 +02:00
Radoslav Gerganov
1db7851d94 Add section to README on how to run the project on Android (#130) 2023-03-14 15:30:08 +02:00
Georgi Gerganov
96d900134f Add Misc section + update hot topics + minor fixes 2023-03-14 09:43:52 +02:00
Georgi Gerganov
3cd7c8e227 Update contribution section, hot topics, limitations, etc. 2023-03-13 19:21:51 +02:00
Pavol Rusnak
b84a31d659 Add quantize script for batch quantization (#92)
* Add quantize script for batch quantization

* Indentation

* README for new quantize.sh

* Fix script name

* Fix file list on Mac OS

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2023-03-13 18:15:20 +02:00
Georgi Gerganov
67d50a97b4 Add initial contribution guidelines 2023-03-13 09:42:26 +02:00
Georgi Gerganov
c240cd1e05 Update README.md 2023-03-12 23:39:01 +02:00
Matvey Soloviev
d35528087e Add interactive mode (#61)
* Initial work on interactive mode.

* Improve interactive mode. Make rev. prompt optional.

* Update README to explain interactive mode.

* Fix OS X build
2023-03-12 23:13:28 +02:00
Marc Köhlbrugge
8de246c2d8 Fix typo in README (#45) 2023-03-12 22:30:08 +02:00
Georgi Gerganov
f6f3f1c7c1 Update README.md 2023-03-12 22:09:26 +02:00
Georgi Gerganov
85c71945cf Revert "weights_only" arg - this causing more trouble than help 2023-03-12 20:59:01 +02:00
Oleksandr Nikitin
a7cf72d75e python/pytorch compat notes (#44) 2023-03-12 14:16:33 +02:00
Georgi Gerganov
dc91ec5d67 Clarify meaning of hacking 2023-03-12 09:03:25 +02:00
Georgi Gerganov
95fb97b137 README: add "Supported platforms" + update hot topics 2023-03-12 08:41:54 +02:00
Georgi Gerganov
e34e3e21c4 Update README.md 2023-03-12 01:26:32 +02:00
Juraj Bednar
4cdcd39348 Fix a typo in model name (#16) 2023-03-11 19:32:20 +02:00
Georgi Gerganov
284d9be2de Update README.md 2023-03-11 18:10:18 +02:00
Georgi Gerganov
cc0f26bef3 Add AVX2 support for x86 architectures thanks to @Const-me ! 2023-03-11 18:04:25 +02:00
Georgi Gerganov
35cb0d2a39 Update README.md 2023-03-11 12:31:21 +02:00
Georgi Gerganov
2d2cadab68 Update Makefile var + add comment 2023-03-11 12:27:02 +02:00
Georgi Gerganov
657074b014 Update README.md 2023-03-11 11:34:25 +02:00
Georgi Gerganov
b53c6356f3 Update README.md 2023-03-11 11:34:11 +02:00
Georgi Gerganov
a2799521b9 Support all LLaMA models + change Q4_0 quantization storage 2023-03-11 11:28:30 +02:00
Simon Willison
d4919344b1 Include Python dependencies in README (#6) 2023-03-11 07:47:26 +02:00
Georgi Gerganov
11dae511e3 Update README.md 2023-03-11 01:30:47 +02:00
Georgi Gerganov
240b0bf6ea Update README.md 2023-03-11 01:22:58 +02:00
Georgi Gerganov
87da10c739 Update README.md 2023-03-11 01:18:10 +02:00
Georgi Gerganov
01e3d38e1c Update README.md 2023-03-11 00:55:22 +02:00
Georgi Gerganov
8d38e7e279 Update README.md 2023-03-11 00:51:46 +02:00
Georgi Gerganov
586e0f1f3d Update README.md 2023-03-11 00:09:19 +02:00
Georgi Gerganov
4c7f13c170 Update README.md 2023-03-10 23:53:11 +02:00
Georgi Gerganov
44f3a5b932 Update README.md 2023-03-10 21:52:27 +02:00
Georgi Gerganov
3cda59d04e Final touches 2023-03-10 21:50:46 +02:00
Georgi Gerganov
b2a7bb3e19 Create README.md 2023-03-10 21:47:46 +02:00