Commit Graph

76 Commits

Author SHA1 Message Date
Pavol Rusnak
944d161986 Make docker instructions more explicit (#785) 2023-04-06 08:56:58 +02:00
Georgi Gerganov
0470adafd0 Update README.md 2023-04-05 19:54:30 +03:00
Georgi Gerganov
a19b5cee08 readme : change logo + add bindings + add uis + add wiki 2023-04-05 18:56:20 +03:00
Adithya Balaji
5cdd9ef43f readme : update with CMake and windows example (#748)
* README: Update with CMake and windows example

* README: update with code-review for cmake build
2023-04-05 17:36:12 +03:00
Thatcher Chamberlin
01e2261e5f Add a missing step to the gpt4all instructions (#690)
`migrate-ggml-2023-03-30-pr613.py` is needed to get gpt4all running.
2023-04-02 12:48:57 +02:00
rimoliga
34977d15c2 readme: replace termux links with homepage, play store is deprecated (#680) 2023-04-01 16:57:30 +02:00
Pavol Rusnak
e88a8002b5 drop quantize.py (now that models are using a single file) 2023-03-31 01:07:32 +02:00
Georgi Gerganov
e19e304480 readme : update supported models 2023-03-30 22:31:54 +03:00
Georgi Gerganov
32d84d4876 readme : fix typos 2023-03-29 19:38:31 +03:00
Georgi Gerganov
689ed6a51e readme : add GPT4All instructions (close #588) 2023-03-29 19:37:20 +03:00
Stephan Walter
3b8b2c584a Update README and comments for standalone perplexity tool (#525) 2023-03-26 16:14:01 +03:00
Georgi Gerganov
3600f1d140 Add logo to README.md 2023-03-26 10:20:49 +03:00
Georgi Gerganov
9d678e17dc Move chat scripts into "./examples" 2023-03-25 20:37:09 +02:00
Georgi Gerganov
1c1459f073 Remove obsolete information from README 2023-03-25 16:30:32 +02:00
Gary Mulder
ccf5a1b08d Update README.md (#444)
Added explicit **bolded** instructions clarifying that people need to request access to models from Facebook and never through through this repo.
2023-03-24 15:23:09 +00:00
Georgi Gerganov
1f369c619d Add link to Roadmap discussion 2023-03-24 09:13:35 +02:00
Stephan Walter
3ebb023fb2 Revert "Delete SHA256SUMS for now" (#429)
* Revert "Delete SHA256SUMS for now (#416)"

This reverts commit 8eea5ae0e5.

* Remove ggml files until they can be verified
* Remove alpaca json
* Add also model/tokenizer.model to SHA256SUMS + update README

---------

Co-authored-by: Pavol Rusnak <pavol@rusnak.io>
2023-03-23 15:15:48 +01:00
Gary Mulder
e689dccbad Move model section from issue template to README.md (#421)
* Update custom.md

* Removed Model section as it is better placed in README.md

* Updates to README.md model section

* Inserted text that was removed from  issue template about obtaining models from FB and links to papers describing the various models

* Removed IPF down links for the Alpaca 7B models as these look to be in the old data format and probably shouldn't be directly linked to, anyway

* Updated the perplexity section to point at Perplexity scores #406 discussion
2023-03-23 11:30:40 +00:00
Georgi Gerganov
a1b7fa8c60 Adjust repetition penalty .. 2023-03-23 10:46:58 +02:00
Georgi Gerganov
1d31d737d8 Add link to recent podcast about whisper.cpp and llama.cpp 2023-03-23 09:48:51 +02:00
Gary Linscott
686427a35f Add details on perplexity to README.md (#395) 2023-03-22 08:53:54 -07:00
Georgi Gerganov
1deed1f1e7 Remove temporary notice and update hot topics 2023-03-22 07:34:02 +02:00
Gary Mulder
3081cf8ed9 Add SHA256SUMS file and instructions to README how to obtain and verify the downloads
Hashes created using:

sha256sum models/*B/*.pth models/*[7136]B/ggml-model-f16.bin* models/*[7136]B/ggml-model-q4_0.bin* > SHA256SUMS
2023-03-21 23:19:11 +01:00
Georgi Gerganov
278d9b3d84 Add notice about pending change 2023-03-21 22:57:35 +02:00
Georgi Gerganov
e7a75316dc Minor style changes 2023-03-21 18:10:32 +02:00
Georgi Gerganov
f57b30a8e1 Add chat.sh script 2023-03-21 18:09:46 +02:00
Georgi Gerganov
614b1afa1c Fix convert script, warnings alpaca instructions, default params 2023-03-21 17:59:16 +02:00
Kevin Kwok
9c26af616b Update IPFS links to quantized alpaca with new tokenizer format (#352) 2023-03-21 17:34:49 +02:00
Mack Straight
60d93896be sentencepiece bpe compatible tokenizer (#252)
* potential out of bounds read

* fix quantize

* style

* Update convert-pth-to-ggml.py

* mild cleanup

* don't need the space-prefixing here rn since main.cpp already does it

* new file magic + version header field

* readme notice

* missing newlines

Co-authored-by: slaren <2141330+slaren@users.noreply.github.com>
2023-03-20 03:17:23 -07:00
Suaj Carrot
3eaf221dee Improved quantize script (#222)
* Improved quantize script

I improved the quantize script by adding error handling and allowing to select many models for quantization at once in the command line. I also converted it to Python for generalization as well as extensibility.

* Fixes and improvements based on Matt's observations

Fixed and improved many things in the script based on the reviews made by @mattsta. The parallelization suggestion is still to be revised, but code for it was still added (commented).

* Small fixes to the previous commit

* Corrected to use the original glob pattern

The original Bash script uses a glob pattern to match files that have endings such as ...bin.0, ...bin.1, etc. That has been translated correctly to Python now.

* Added support for Windows and updated README to use this script

New code to set the name of the quantize script binary depending on the platform has been added (quantize.exe if working on Windows) and the README.md file has been updated to use this script instead of the Bash one.

* Fixed a typo and removed shell=True in the subprocess.run call

Fixed a typo regarding the new filenames of the quantized models and removed the shell=True parameter in the subprocess.run call as it was conflicting with the list of parameters.

* Corrected previous commit

* Small tweak: changed the name of the program in argparse

This was making the automatic help message to be suggesting the program's usage as being literally "$ Quantization Script [arguments]". It should now be something like "$ python3 quantize.py [arguments]".
2023-03-19 20:38:44 +02:00
Georgi Gerganov
eed44b2875 Update hot topics to mention Alpaca support 2023-03-19 19:51:55 +02:00
Georgi Gerganov
b3bd91ce9d Add instruction for using Alpaca (#240) 2023-03-19 18:49:50 +02:00
Pavol Rusnak
f9cb6f8979 Fix typo in readme 2023-03-18 23:18:04 +01:00
Pavol Rusnak
cc4ace10bf Add note about Python 3.11 to readme 2023-03-18 22:25:35 +01:00
Pavol Rusnak
0bfb4f160f Add memory/disk requirements to readme 2023-03-18 22:25:35 +01:00
Georgi Gerganov
b57c1e4295 Update Contributing section 2023-03-17 20:30:04 +02:00
Stephan Walter
45113b2f42 Don't tell users to use a bad number of threads (#243)
The readme tells people to use the command line option "-t 8", causing 8
threads to be started. On systems with fewer than 8 cores, this causes a
significant slowdown. Remove the option from the example command lines
and use /proc/cpuinfo on Linux to determine a sensible default.
2023-03-17 19:47:35 +02:00
Bernat Vadell
afcd16588e 🚀 Dockerize llamacpp (#132)
* feat: dockerize llamacpp

* feat: split build & runtime stages

* split dockerfile into main & tools

* add quantize into tool docker image

* Update .devops/tools.sh

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

* add docker action pipeline

* change CI to publish at github docker registry

* fix name runs-on macOS-latest is macos-latest (lowercase)

* include docker versioned images

* fix github action docker

* fix docker.yml

* feat: include all-in-one command tool & update readme.md

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2023-03-17 10:47:06 +01:00
Georgi Gerganov
b73c11ea8c Update README.md 2023-03-16 15:00:09 +02:00
Georgi Gerganov
be7fc04a86 Expand "Contributing" section 2023-03-16 08:55:13 +02:00
Georgi Gerganov
e27ab8cb60 Update hot topics - RMSnorm 2023-03-16 07:12:12 +02:00
moritzbrantner
3ffbb46e32 fixed typo (#178) 2023-03-15 22:35:25 +02:00
Musab Gultekin
3a59f2ef9b Fix potential licensing issue (#126)
* Update README.md

* Update README.md

remove facebook
2023-03-15 21:39:06 +02:00
Radoslav Gerganov
1db7851d94 Add section to README on how to run the project on Android (#130) 2023-03-14 15:30:08 +02:00
Georgi Gerganov
96d900134f Add Misc section + update hot topics + minor fixes 2023-03-14 09:43:52 +02:00
Georgi Gerganov
3cd7c8e227 Update contribution section, hot topics, limitations, etc. 2023-03-13 19:21:51 +02:00
Pavol Rusnak
b84a31d659 Add quantize script for batch quantization (#92)
* Add quantize script for batch quantization

* Indentation

* README for new quantize.sh

* Fix script name

* Fix file list on Mac OS

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2023-03-13 18:15:20 +02:00
Georgi Gerganov
67d50a97b4 Add initial contribution guidelines 2023-03-13 09:42:26 +02:00
Georgi Gerganov
c240cd1e05 Update README.md 2023-03-12 23:39:01 +02:00
Matvey Soloviev
d35528087e Add interactive mode (#61)
* Initial work on interactive mode.

* Improve interactive mode. Make rev. prompt optional.

* Update README to explain interactive mode.

* Fix OS X build
2023-03-12 23:13:28 +02:00