Commit Graph

  • a30749e299 Fix token count accounting Matvey Soloviev 2023-03-13 00:35:51 +01:00
  • 460c482540 Fix token count accounting Matvey Soloviev 2023-03-13 00:35:51 +01:00
  • 49a8c7675b Revert "10% performance boost on ARM" Georgi Gerganov 2023-03-13 01:28:08 +02:00
  • c80e2a8f2a Revert "10% performance boost on ARM" Georgi Gerganov 2023-03-13 01:28:08 +02:00
  • c47fa0ea5e Check for vdotq_s32 availability Georgi Gerganov 2023-03-13 01:21:03 +02:00
  • 54a0e66ea0 Check for vdotq_s32 availability Georgi Gerganov 2023-03-13 01:21:03 +02:00
  • c00675331e Ammend to previous commit - forgot to update non-QRDMX branch Georgi Gerganov 2023-03-13 01:05:24 +02:00
  • 543c57e991 Ammend to previous commit - forgot to update non-QRDMX branch Georgi Gerganov 2023-03-13 01:05:24 +02:00
  • f48b7628ea 10% performance boost on ARM Georgi Gerganov 2023-03-13 00:56:10 +02:00
  • 113a9e83eb 10% performance boost on ARM Georgi Gerganov 2023-03-13 00:56:10 +02:00
  • fedc405b41 Fix color getting reset before prompt output done (#65) Matvey Soloviev 2023-03-12 23:07:34 +01:00
  • 404fac0d62 Fix color getting reset before prompt output done (#65) Matvey Soloviev 2023-03-12 23:07:34 +01:00
  • c240cd1e05 Update README.md Georgi Gerganov 2023-03-12 23:39:01 +02:00
  • 1a0a74300f Update README.md Georgi Gerganov 2023-03-12 23:39:01 +02:00
  • d35528087e Add interactive mode (#61) Matvey Soloviev 2023-03-12 22:13:28 +01:00
  • 96ea727f47 Add interactive mode (#61) Matvey Soloviev 2023-03-12 22:13:28 +01:00
  • 8de246c2d8 Fix typo in README (#45) Marc Köhlbrugge 2023-03-13 03:30:08 +07:00
  • 9661954835 Fix typo in README (#45) Marc Köhlbrugge 2023-03-13 03:30:08 +07:00
  • 7a708ee9b0 Allow using prompt files (#59) Ben Garney 2023-03-12 13:28:36 -07:00
  • f385f8dee8 Allow using prompt files (#59) Ben Garney 2023-03-12 13:28:36 -07:00
  • c763dc1bc2 Add back top_k (#56) beiller 2023-03-12 16:23:15 -04:00
  • 02f0c6fe7f Add back top_k (#56) beiller 2023-03-12 16:23:15 -04:00
  • fde84afbed Windows fixes (#31) Sebastián A 2023-03-12 17:15:00 -03:00
  • eb062bb012 Windows fixes (#31) Sebastián A 2023-03-12 17:15:00 -03:00
  • f6f3f1c7c1 Update README.md Georgi Gerganov 2023-03-12 22:09:26 +02:00
  • 7027a97837 Update README.md Georgi Gerganov 2023-03-12 22:09:26 +02:00
  • 1f0283048a Add CI (#60) Georgi Gerganov 2023-03-12 22:08:24 +02:00
  • 2d555e5b42 Add CI (#60) Georgi Gerganov 2023-03-12 22:08:24 +02:00
  • 85c71945cf Revert "weights_only" arg - this causing more trouble than help Georgi Gerganov 2023-03-12 20:59:01 +02:00
  • 7c9e54e55e Revert "weights_only" arg - this causing more trouble than help Georgi Gerganov 2023-03-12 20:59:01 +02:00
  • a7cf72d75e python/pytorch compat notes (#44) Oleksandr Nikitin 2023-03-12 14:16:33 +02:00
  • b9bd1d0141 python/pytorch compat notes (#44) Oleksandr Nikitin 2023-03-12 14:16:33 +02:00
  • a63a748bba Add repetition penalty (#20) beiller 2023-03-12 05:27:42 -04:00
  • 129c7d1ea8 Add repetition penalty (#20) beiller 2023-03-12 05:27:42 -04:00
  • dc91ec5d67 Clarify meaning of hacking Georgi Gerganov 2023-03-12 09:03:25 +02:00
  • 702fddf5c5 Clarify meaning of hacking Georgi Gerganov 2023-03-12 09:03:25 +02:00
  • 95fb97b137 README: add "Supported platforms" + update hot topics Georgi Gerganov 2023-03-12 08:41:54 +02:00
  • 7d86e25bf6 README: add "Supported platforms" + update hot topics Georgi Gerganov 2023-03-12 08:41:54 +02:00
  • a9b53a036b use weights_only in conversion script (#32) deepdiffuser 2023-03-11 22:36:35 -08:00
  • a93120236f use weights_only in conversion script (#32) deepdiffuser 2023-03-11 22:36:35 -08:00
  • 865eff3820 Add LICENSE (#21) Pavol Rusnak 2023-03-12 07:36:03 +01:00
  • 6a9a67f0be Add LICENSE (#21) Pavol Rusnak 2023-03-12 07:36:03 +01:00
  • e34e3e21c4 Update README.md Georgi Gerganov 2023-03-12 01:26:32 +02:00
  • da1a4ff01f Update README.md Georgi Gerganov 2023-03-12 01:26:32 +02:00
  • 4cdcd39348 Fix a typo in model name (#16) Juraj Bednar 2023-03-11 18:32:20 +01:00
  • 6b2cb6302f Fix a typo in model name (#16) Juraj Bednar 2023-03-11 18:32:20 +01:00
  • 284d9be2de Update README.md Georgi Gerganov 2023-03-11 18:10:18 +02:00
  • 4235e3d5b3 Update README.md Georgi Gerganov 2023-03-11 18:10:18 +02:00
  • cc0f26bef3 Add AVX2 support for x86 architectures thanks to @Const-me ! Georgi Gerganov 2023-03-11 17:58:18 +02:00
  • f1eaff4721 Add AVX2 support for x86 architectures thanks to @Const-me ! Georgi Gerganov 2023-03-11 17:58:18 +02:00
  • bc3184cb2d Fix un-initialized FP16 tables on x86 (#15, #2) Georgi Gerganov 2023-03-11 17:40:14 +02:00
  • a9e58529ea Fix un-initialized FP16 tables on x86 (#15, #2) Georgi Gerganov 2023-03-11 17:40:14 +02:00
  • 5afe16962e Bump memory buffer Georgi Gerganov 2023-03-11 12:44:21 +02:00
  • 7d9ed7b25f Bump memory buffer Georgi Gerganov 2023-03-11 12:44:21 +02:00
  • 35cb0d2a39 Update README.md Georgi Gerganov 2023-03-11 12:31:21 +02:00
  • 0c6803321c Update README.md Georgi Gerganov 2023-03-11 12:31:21 +02:00
  • 34bb8821d6 .gitignore models/ Georgi Gerganov 2023-03-11 12:26:46 +02:00
  • f60fa9e50a .gitignore models/ Georgi Gerganov 2023-03-11 12:26:46 +02:00
  • 2d2cadab68 Update Makefile var + add comment Georgi Gerganov 2023-03-11 12:26:16 +02:00
  • 7211862c94 Update Makefile var + add comment Georgi Gerganov 2023-03-11 12:26:16 +02:00
  • 657074b014 Update README.md Georgi Gerganov 2023-03-11 11:34:25 +02:00
  • a5c5ae2f54 Update README.md Georgi Gerganov 2023-03-11 11:34:25 +02:00
  • b53c6356f3 Update README.md Georgi Gerganov 2023-03-11 11:34:11 +02:00
  • ea977e85ec Update README.md Georgi Gerganov 2023-03-11 11:34:11 +02:00
  • a2799521b9 Support all LLaMA models + change Q4_0 quantization storage Georgi Gerganov 2023-03-11 10:47:09 +02:00
  • 007a8f6f45 Support all LLaMA models + change Q4_0 quantization storage Georgi Gerganov 2023-03-11 10:47:09 +02:00
  • d4919344b1 Include Python dependencies in README (#6) Simon Willison 2023-03-10 21:47:26 -08:00
  • 5f2f970d51 Include Python dependencies in README (#6) Simon Willison 2023-03-10 21:47:26 -08:00
  • 11dae511e3 Update README.md Georgi Gerganov 2023-03-11 01:30:47 +02:00
  • 73c6ed5e87 Update README.md Georgi Gerganov 2023-03-11 01:30:47 +02:00
  • 240b0bf6ea Update README.md Georgi Gerganov 2023-03-11 01:22:58 +02:00
  • 01eeed8fb1 Update README.md Georgi Gerganov 2023-03-11 01:22:58 +02:00
  • 87da10c739 Update README.md Georgi Gerganov 2023-03-11 01:18:10 +02:00
  • 6da2df34ee Update README.md Georgi Gerganov 2023-03-11 01:18:10 +02:00
  • 12131a74cb Add missing headers for memcpy and assert (#3) Jean-Michaël Celerier 2023-03-10 18:04:06 -05:00
  • 9dcf4dba45 Add missing headers for memcpy and assert (#3) Jean-Michaël Celerier 2023-03-10 18:04:06 -05:00
  • 01e3d38e1c Update README.md Georgi Gerganov 2023-03-11 00:55:22 +02:00
  • 920a7fe2d9 Update README.md Georgi Gerganov 2023-03-11 00:55:22 +02:00
  • 8d38e7e279 Update README.md Georgi Gerganov 2023-03-11 00:51:46 +02:00
  • 3a57ee59de Update README.md Georgi Gerganov 2023-03-11 00:51:46 +02:00
  • 586e0f1f3d Update README.md Georgi Gerganov 2023-03-11 00:09:19 +02:00
  • b85028522d Update README.md Georgi Gerganov 2023-03-11 00:09:19 +02:00
  • 4c7f13c170 Update README.md Georgi Gerganov 2023-03-10 23:53:11 +02:00
  • 8a01f565ff Update README.md Georgi Gerganov 2023-03-10 23:53:11 +02:00
  • 8453184bb2 Fix a bug in the rope calculation Georgi Gerganov 2023-03-10 23:46:39 +02:00
  • 70bc0b8b15 Fix a bug in the rope calculation Georgi Gerganov 2023-03-10 23:46:39 +02:00
  • 44f3a5b932 Update README.md Georgi Gerganov 2023-03-10 21:52:27 +02:00
  • 18ebda34d6 Update README.md Georgi Gerganov 2023-03-10 21:52:27 +02:00
  • 3cda59d04e Final touches Georgi Gerganov 2023-03-10 21:50:46 +02:00
  • 319cdb3e1f Final touches Georgi Gerganov 2023-03-10 21:50:46 +02:00
  • b2a7bb3e19 Create README.md Georgi Gerganov 2023-03-10 21:47:46 +02:00
  • 775328064e Create README.md Georgi Gerganov 2023-03-10 21:47:46 +02:00
  • 4b5b86d6ee Initial release Georgi Gerganov 2023-03-10 20:40:58 +02:00
  • 26c0846629 Initial release Georgi Gerganov 2023-03-10 20:40:58 +02:00