Commit Graph

  • 160bfb217d Update hot topics to mention Alpaca support Georgi Gerganov 2023-03-19 19:51:55 +02:00
  • 9b87f28d8d Fix off-by-one bug (#115) Georgi Gerganov 2023-03-19 19:46:32 +02:00
  • c494ed5b94 Fix off-by-one bug (#115) Georgi Gerganov 2023-03-19 19:46:32 +02:00
  • c03de4d3c6 Fix python stuff (#109) Georgi Gerganov 2023-03-19 19:33:18 +02:00
  • c1c7026b47 Fix python stuff (#109) Georgi Gerganov 2023-03-19 19:33:18 +02:00
  • 6ebde6c9de Refactoring convert-pth-to-ggml.py: more concise and readable (#109) qunash 2023-03-19 20:17:39 +03:00
  • 467b149761 Refactoring convert-pth-to-ggml.py: more concise and readable (#109) qunash 2023-03-19 20:17:39 +03:00
  • 5ff8d6f48e Drop trailing new line from file prompts (#80) Georgi Gerganov 2023-03-19 19:04:44 +02:00
  • 70f01cb863 Drop trailing new line from file prompts (#80) Georgi Gerganov 2023-03-19 19:04:44 +02:00
  • b3bd91ce9d Add instruction for using Alpaca (#240) Georgi Gerganov 2023-03-19 18:49:50 +02:00
  • a4e63b73df Add instruction for using Alpaca (#240) Georgi Gerganov 2023-03-19 18:49:50 +02:00
  • 2ad520e892 Add "--instruct" argument for usage with Alpaca (#240) Georgi Gerganov 2023-03-19 18:37:02 +02:00
  • 9e1707218a Add "--instruct" argument for usage with Alpaca (#240) Georgi Gerganov 2023-03-19 18:37:02 +02:00
  • 960c6bfb09 Change RMSNorm eps to 1e-6 (#173) Georgi Gerganov 2023-03-19 17:30:00 +02:00
  • 22213a17b5 Change RMSNorm eps to 1e-6 (#173) Georgi Gerganov 2023-03-19 17:30:00 +02:00
  • 0334548e49 Warn user if a context size greater than 2048 tokens is specified (#274) Ronsor 2023-03-18 17:10:47 -07:00
  • d7def1a752 Warn user if a context size greater than 2048 tokens is specified (#274) Ronsor 2023-03-18 17:10:47 -07:00
  • f9cb6f8979 Fix typo in readme Pavol Rusnak 2023-03-18 22:39:46 +01:00
  • 6f61c18ec9 Fix typo in readme Pavol Rusnak 2023-03-18 22:39:46 +01:00
  • cc4ace10bf Add note about Python 3.11 to readme Pavol Rusnak 2023-03-18 22:20:04 +01:00
  • 1e5a6d088d Add note about Python 3.11 to readme Pavol Rusnak 2023-03-18 22:20:04 +01:00
  • 0bfb4f160f Add memory/disk requirements to readme Pavol Rusnak 2023-03-18 21:58:46 +01:00
  • 554b541521 Add memory/disk requirements to readme Pavol Rusnak 2023-03-18 21:58:46 +01:00
  • 2e061e8283 Remove unused code since n_vocab is model.hparams.n_vocab (#262) Alex Nguyen 2023-03-18 20:51:49 +07:00
  • d3f202d57b Remove unused code since n_vocab is model.hparams.n_vocab (#262) Alex Nguyen 2023-03-18 20:51:49 +07:00
  • 62c6897bc1 fixed warning with std::ignore about unused function result (#151) Justin Suess 2023-03-18 07:44:09 -04:00
  • e03e359730 fixed warning with std::ignore about unused function result (#151) Justin Suess 2023-03-18 07:44:09 -04:00
  • ce7029fe88 Fix n^2 loop in tokenization (#254) Gary Linscott 2023-03-18 04:17:19 -07:00
  • a81d0c2a17 Fix n^2 loop in tokenization (#254) Gary Linscott 2023-03-18 04:17:19 -07:00
  • f2f67abb1a CI Improvements (#230) anzz1 2023-03-18 09:27:12 +02:00
  • b2de7f18df CI Improvements (#230) anzz1 2023-03-18 09:27:12 +02:00
  • ec4dddf648 Nix flake (#40) Niklas Korz 2023-03-17 23:03:48 +01:00
  • a292747893 Nix flake (#40) Niklas Korz 2023-03-17 23:03:48 +01:00
  • 95eab2152b Implement non-greedy tokenizer that tries to maximize token lengths (#242) thement 2023-03-17 21:05:58 +01:00
  • c9f670a177 Implement non-greedy tokenizer that tries to maximize token lengths (#242) thement 2023-03-17 21:05:58 +01:00
  • e9cd7a260b Default to 4 threads (#243) Georgi Gerganov 2023-03-17 21:46:46 +02:00
  • 4f54609110 Default to 4 threads (#243) Georgi Gerganov 2023-03-17 21:46:46 +02:00
  • b57c1e4295 Update Contributing section Georgi Gerganov 2023-03-17 20:30:04 +02:00
  • e81b9c81c1 Update Contributing section Georgi Gerganov 2023-03-17 20:30:04 +02:00
  • 45113b2f42 Don't tell users to use a bad number of threads (#243) Stephan Walter 2023-03-17 17:47:35 +00:00
  • 367946c668 Don't tell users to use a bad number of threads (#243) Stephan Walter 2023-03-17 17:47:35 +00:00
  • 527af9cbdd add ptread link to fix cmake build under linux (#114) mmyjona 2023-03-18 00:38:24 +08:00
  • 6b0df5ccf3 add ptread link to fix cmake build under linux (#114) mmyjona 2023-03-18 00:38:24 +08:00
  • afcd16588e 🚀 Dockerize llamacpp (#132) Bernat Vadell 2023-03-17 10:47:06 +01:00
  • 2af23d3043 🚀 Dockerize llamacpp (#132) Bernat Vadell 2023-03-17 10:47:06 +01:00
  • 03c8e88515 Q4_1 quantization (#193) Matvey Soloviev 2023-03-17 05:48:39 +01:00
  • 904d2a8d6a Q4_1 quantization (#193) Matvey Soloviev 2023-03-17 05:48:39 +01:00
  • b73c11ea8c Update README.md Georgi Gerganov 2023-03-16 15:00:09 +02:00
  • 721311070e Update README.md Georgi Gerganov 2023-03-16 15:00:09 +02:00
  • be7fc04a86 Expand "Contributing" section Georgi Gerganov 2023-03-16 08:55:13 +02:00
  • ac15de7895 Expand "Contributing" section Georgi Gerganov 2023-03-16 08:55:13 +02:00
  • e27ab8cb60 Update hot topics - RMSnorm Georgi Gerganov 2023-03-16 07:12:12 +02:00
  • 273abc47ff Update hot topics - RMSnorm Georgi Gerganov 2023-03-16 07:12:12 +02:00
  • 1b96142bae Fix RMS norm in GGML (#191) Nebula 2023-03-15 19:29:25 -04:00
  • 9b4a15b17d Fix RMS norm in GGML (#191) Nebula 2023-03-15 19:29:25 -04:00
  • 12b9bd9b13 Add RMS norm and use it (#187) hoangmit 2023-03-15 18:41:38 -04:00
  • 6eac39ba95 Add RMS norm and use it (#187) hoangmit 2023-03-15 18:41:38 -04:00
  • 3ffbb46e32 fixed typo (#178) moritzbrantner 2023-03-15 21:35:25 +01:00
  • 27944c4206 fixed typo (#178) moritzbrantner 2023-03-15 21:35:25 +01:00
  • f88e2693cc add SIGINT support for _WIN32 environments (#120) Rickey Bowers Jr 2023-03-15 13:56:24 -06:00
  • 2d15d6c9a9 add SIGINT support for _WIN32 environments (#120) Rickey Bowers Jr 2023-03-15 13:56:24 -06:00
  • a4d17b7096 added ctx_size parameter (#148) Justin Suess 2023-03-15 15:42:40 -04:00
  • 2d64715ad4 added ctx_size parameter (#148) Justin Suess 2023-03-15 15:42:40 -04:00
  • 3d4b93a8d4 fixed color reset on exit (#149) Justin Suess 2023-03-15 15:39:38 -04:00
  • 16b2c61a22 fixed color reset on exit (#149) Justin Suess 2023-03-15 15:39:38 -04:00
  • 3a59f2ef9b Fix potential licensing issue (#126) Musab Gultekin 2023-03-15 22:39:06 +03:00
  • 977295c700 Fix potential licensing issue (#126) Musab Gultekin 2023-03-15 22:39:06 +03:00
  • dba21f1c6f Use tokenizer.vocab_size() instead of hardcoding 32000 in convert-pth-to-ggml.py (#142) Ronsor 2023-03-15 12:37:50 -07:00
  • 956dfda8ad Use tokenizer.vocab_size() instead of hardcoding 32000 in convert-pth-to-ggml.py (#142) Ronsor 2023-03-15 12:37:50 -07:00
  • 735b1a2aaa inline -> static inline for "bytesFromNibbles" (#161) hoangmit 2023-03-15 15:05:14 -04:00
  • 113e685d18 inline -> static inline for "bytesFromNibbles" (#161) hoangmit 2023-03-15 15:05:14 -04:00
  • 55f8043b2f Don't use vdotq_s32 if it's not available (#139) Ronsor 2023-03-14 12:34:37 -07:00
  • 47857e564c Don't use vdotq_s32 if it's not available (#139) Ronsor 2023-03-14 12:34:37 -07:00
  • 1db7851d94 Add section to README on how to run the project on Android (#130) Radoslav Gerganov 2023-03-14 15:30:08 +02:00
  • 60f819a2b1 Add section to README on how to run the project on Android (#130) Radoslav Gerganov 2023-03-14 15:30:08 +02:00
  • 96d900134f Add Misc section + update hot topics + minor fixes Georgi Gerganov 2023-03-14 09:43:52 +02:00
  • 97ab2b2578 Add Misc section + update hot topics + minor fixes Georgi Gerganov 2023-03-14 09:43:52 +02:00
  • 7925ae2017 Add windows to the CI (#98) Sebastián A 2023-03-13 17:29:10 -03:00
  • 2f700a2738 Add windows to the CI (#98) Sebastián A 2023-03-13 17:29:10 -03:00
  • 3e39a552f6 CMake build in Release by default (#75) Georgi Gerganov 2023-03-13 21:22:15 +02:00
  • c09a9cfb06 CMake build in Release by default (#75) Georgi Gerganov 2023-03-13 21:22:15 +02:00
  • 3cd7c8e227 Update contribution section, hot topics, limitations, etc. Georgi Gerganov 2023-03-13 19:21:51 +02:00
  • 7ec903d3c1 Update contribution section, hot topics, limitations, etc. Georgi Gerganov 2023-03-13 19:21:51 +02:00
  • 222ee5f918 Print system information Georgi Gerganov 2023-03-13 19:15:08 +02:00
  • 4497ad819c Print system information Georgi Gerganov 2023-03-13 19:15:08 +02:00
  • 4acda08f42 Initial support for CMake (#75) Sebastián A 2023-03-13 14:12:33 -03:00
  • ed6849cc07 Initial support for CMake (#75) Sebastián A 2023-03-13 14:12:33 -03:00
  • d3ed019b74 Add NetBSD support. (#90) Thomas Klausner 2023-03-13 17:40:54 +01:00
  • 41be0a3b3d Add NetBSD support. (#90) Thomas Klausner 2023-03-13 17:40:54 +01:00
  • e429f5b9e0 Use fprintf for diagnostic output (#48) Pavol Rusnak 2023-03-13 17:39:56 +01:00
  • 671d5cac15 Use fprintf for diagnostic output (#48) Pavol Rusnak 2023-03-13 17:39:56 +01:00
  • c1eebc2a25 Use vdotq_s32 to improve performance (#67) Georgi Gerganov 2023-03-13 18:36:44 +02:00
  • 84d9015c4a Use vdotq_s32 to improve performance (#67) Georgi Gerganov 2023-03-13 18:36:44 +02:00
  • a81c113197 Reduce model loading time (#43) uint256_t 2023-03-14 01:33:43 +09:00
  • 63fd76fbb0 Reduce model loading time (#43) uint256_t 2023-03-14 01:33:43 +09:00
  • d35d36dff8 Fix UTF-8 handling (including colors) (#79) Val Kharitonov 2023-03-13 12:24:18 -04:00
  • 2a20f48efa Fix UTF-8 handling (including colors) (#79) Val Kharitonov 2023-03-13 12:24:18 -04:00
  • b84a31d659 Add quantize script for batch quantization (#92) Pavol Rusnak 2023-03-13 17:15:20 +01:00
  • d1f224712d Add quantize script for batch quantization (#92) Pavol Rusnak 2023-03-13 17:15:20 +01:00
  • 67d50a97b4 Add initial contribution guidelines Georgi Gerganov 2023-03-13 09:42:26 +02:00