Commit Graph

  • c7ef8d36f8 cuda : deduplicated dequantization code (#1453) Johannes Gäßler 2023-05-14 20:53:23 +02:00
  • eb363627fd cuda : deduplicated dequantization code (#1453) Johannes Gäßler 2023-05-14 20:53:23 +02:00
  • 9b2e653e7f ggml : alternative fix for race condition bug in non-inplace ggml_compute_forward_diag_mask_f32 (#1454) xaedes 2023-05-14 17:55:02 +02:00
  • 79b2d5b69d ggml : alternative fix for race condition bug in non-inplace ggml_compute_forward_diag_mask_f32 (#1454) xaedes 2023-05-14 17:55:02 +02:00
  • e28fe5eef9 ggml : various fixes (#1450) Georgi Gerganov 2023-05-14 18:22:50 +03:00
  • 13c351ad72 ggml : various fixes (#1450) Georgi Gerganov 2023-05-14 18:22:50 +03:00
  • bd762b3971 ggml : add AVX support based on AVX2 code (#1430) katsu560 2023-05-14 19:03:51 +09:00
  • 60f8c361ca ggml : add AVX support based on AVX2 code (#1430) katsu560 2023-05-14 19:03:51 +09:00
  • 635da03e3b ggml : add GGML_QNT_VERSION to track quantization format changes Georgi Gerganov 2023-05-14 10:20:19 +03:00
  • 601a033475 ggml : add GGML_QNT_VERSION to track quantization format changes Georgi Gerganov 2023-05-14 10:20:19 +03:00
  • a235cdd440 cuda : fix convert function (#1412) Georgi Gerganov 2023-05-13 17:40:58 +03:00
  • 08737ef720 cuda : fix convert function (#1412) Georgi Gerganov 2023-05-13 17:40:58 +03:00
  • 1fe536b925 make : fix PERF build with cuBLAS Georgi Gerganov 2023-05-13 17:25:09 +03:00
  • bda4d7c215 make : fix PERF build with cuBLAS Georgi Gerganov 2023-05-13 17:25:09 +03:00
  • 3f7a2198d0 llama : fix unused warning Georgi Gerganov 2023-05-13 16:55:14 +03:00
  • 5a5aeb1e91 llama : fix unused warning Georgi Gerganov 2023-05-13 16:55:14 +03:00
  • 7fb8465533 ggml : multi-thread mul and diag_mask ops (#1428) Georgi Gerganov 2023-05-13 16:48:03 +03:00
  • 66841fdb0e ggml : multi-thread mul and diag_mask ops (#1428) Georgi Gerganov 2023-05-13 16:48:03 +03:00
  • dc54d1e549 ggml : GPU-accelerated token generation (#1412) Johannes Gäßler 2023-05-13 15:38:36 +02:00
  • 905d87b70a ggml : GPU-accelerated token generation (#1412) Johannes Gäßler 2023-05-13 15:38:36 +02:00
  • 70bad35cf6 ggml : implement backward pass for llama + small training-llama-from-scratch example (#1360) xaedes 2023-05-13 14:56:40 +02:00
  • f954edda93 ggml : implement backward pass for llama + small training-llama-from-scratch example (#1360) xaedes 2023-05-13 14:56:40 +02:00
  • 017260390c ggml : sync alibi fix from ggml repo Georgi Gerganov 2023-05-13 11:54:33 +03:00
  • f048af0230 ggml : sync alibi fix from ggml repo Georgi Gerganov 2023-05-13 11:54:33 +03:00
  • 974fbfe322 Adding SSE instructions to ggml_vec_dot_q4_0_q8_0 (#1413) 3ooabkhxtn 2023-05-13 10:43:33 +02:00
  • ac0cd259d5 Adding SSE instructions to ggml_vec_dot_q4_0_q8_0 (#1413) 3ooabkhxtn 2023-05-13 10:43:33 +02:00
  • 6727abd8bb llama : fix various warnings Georgi Gerganov 2023-05-13 11:23:15 +03:00
  • 0cd22e190a llama : fix various warnings Georgi Gerganov 2023-05-13 11:23:15 +03:00
  • 44bb8a8093 embedding : remove unused code (#1426) Rinne 2023-05-13 15:24:20 +08:00
  • 6456a4eb9f embedding : remove unused code (#1426) Rinne 2023-05-13 15:24:20 +08:00
  • 490afcc5d0 readme : update Q4_0 perplexities Georgi Gerganov 2023-05-13 09:12:44 +03:00
  • cdd5350892 readme : update Q4_0 perplexities Georgi Gerganov 2023-05-13 09:12:44 +03:00
  • 5badacc9cd llama : free ggml context in set / copy state data (close #1425) Georgi Gerganov 2023-05-13 09:08:52 +03:00
  • 738ace394a llama : free ggml context in set / copy state data (close #1425) Georgi Gerganov 2023-05-13 09:08:52 +03:00
  • 12d0bb07ba opencl : fix kernels for the new formats (#1422) Henri Vasserman 2023-05-13 09:01:15 +03:00
  • 699b1ad7fe opencl : fix kernels for the new formats (#1422) Henri Vasserman 2023-05-13 09:01:15 +03:00
  • 5e63f54bfd llama : fix --mtest option (close #1414) Georgi Gerganov 2023-05-12 21:44:20 +03:00
  • fb62f92433 llama : fix --mtest option (close #1414) Georgi Gerganov 2023-05-12 21:44:20 +03:00
  • 10968554a5 CLI args use - instead of _, backwards compatible (#1416) Johannes Gäßler 2023-05-12 16:34:55 +02:00
  • 773ee249fb CLI args use - instead of _, backwards compatible (#1416) Johannes Gäßler 2023-05-12 16:34:55 +02:00
  • f6c8d06cd5 Add clang-tidy reviews to CI (#1407) slaren 2023-05-12 15:40:53 +02:00
  • 553fd4d4b5 Add clang-tidy reviews to CI (#1407) slaren 2023-05-12 15:40:53 +02:00
  • 440c83c83d readme : add C#/.NET bindings repo (#1409) Rinne 2023-05-12 13:39:40 +08:00
  • 089b1c93ba readme : add C#/.NET bindings repo (#1409) Rinne 2023-05-12 13:39:40 +08:00
  • 65f22db997 ggml : remove bit shuffling (#1405) Georgi Gerganov 2023-05-12 00:23:08 +03:00
  • b9fd7eee57 ggml : remove bit shuffling (#1405) Georgi Gerganov 2023-05-12 00:23:08 +03:00
  • 92fc2e0acc prompts : model agnostic DAN (#1304) CRD716 2023-05-11 10:10:19 -05:00
  • b608b55a3e prompts : model agnostic DAN (#1304) CRD716 2023-05-11 10:10:19 -05:00
  • 1c5dd23df0 main : add option to save full output to session (#1338) Evan Jones 2023-05-10 11:37:14 -04:00
  • cf348a60e0 main : add option to save full output to session (#1338) Evan Jones 2023-05-10 11:37:14 -04:00
  • adb8154d20 Locale fix for Windows (#1379) DannyDaemonic 2023-05-09 10:53:28 -07:00
  • e6a46b0ed1 Locale fix for Windows (#1379) DannyDaemonic 2023-05-09 10:53:28 -07:00
  • dee7de1dbe use pause asm insn in busyloop to run the CPU (13600K) 10 °C cooler (#1314) Sami Farin 2023-05-09 15:29:20 +03:00
  • 9f8dbc4787 use pause asm insn in busyloop to run the CPU (13600K) 10 °C cooler (#1314) Sami Farin 2023-05-09 15:29:20 +03:00
  • 2471a5348d Interface improvements and --multiline-input (previously --author-mode) (#1040) DannyDaemonic 2023-05-08 19:45:48 -07:00
  • 41654efea8 Interface improvements and --multiline-input (previously --author-mode) (#1040) DannyDaemonic 2023-05-08 19:45:48 -07:00
  • b778f64c54 readme : add notice about upcoming breaking change Georgi Gerganov 2023-05-08 22:52:18 +03:00
  • 56551bc11f readme : add notice about upcoming breaking change Georgi Gerganov 2023-05-08 22:52:18 +03:00
  • bcfad0f2bc readme : add TOC and Pygmalion instructions (#1359) AlpinDale 2023-05-08 21:03:30 +04:30
  • fe60904eef readme : add TOC and Pygmalion instructions (#1359) AlpinDale 2023-05-08 21:03:30 +04:30
  • 9fd75bdbc4 llama : fix hparams shadow (#1367) Pavol Rusnak 2023-05-08 16:48:21 +02:00
  • 003ba2fb43 llama : fix hparams shadow (#1367) Pavol Rusnak 2023-05-08 16:48:21 +02:00
  • e5c1a67530 llama : require first token to be BOS (#1303) Georgi Gerganov 2023-05-08 17:41:54 +03:00
  • f9a6364912 llama : require first token to be BOS (#1303) Georgi Gerganov 2023-05-08 17:41:54 +03:00
  • ac1879ff60 convert: add ability to convert safetensors files (#1276) ubik2 2023-05-08 04:54:26 -07:00
  • 95078cc554 convert: add ability to convert safetensors files (#1276) ubik2 2023-05-08 04:54:26 -07:00
  • 2e69d4ed6e Documented CUDA reproducibility, added warning (#1346) Johannes Gäßler 2023-05-08 02:42:01 +02:00
  • 1f48b0abcf Documented CUDA reproducibility, added warning (#1346) Johannes Gäßler 2023-05-08 02:42:01 +02:00
  • 61a2558d6d CI: add Windows CLBlast and OpenBLAS builds (#1277) Henri Vasserman 2023-05-07 14:20:09 +03:00
  • e1295513a4 CI: add Windows CLBlast and OpenBLAS builds (#1277) Henri Vasserman 2023-05-07 14:20:09 +03:00
  • 816beef099 ggml : Allow usage of CLBlast alongside Accelerate.framework (#1336) swittk 2023-05-07 10:03:23 +07:00
  • 1b0fd45465 ggml : Allow usage of CLBlast alongside Accelerate.framework (#1336) swittk 2023-05-07 10:03:23 +07:00
  • 80c538b16d Remove default arguments from sampling functions (#1343) Jed Fox 2023-05-06 17:01:47 -04:00
  • 3924088512 Remove default arguments from sampling functions (#1343) Jed Fox 2023-05-06 17:01:47 -04:00
  • fe86ad336b makefile: automatic Arch Linux detection (#1332) DaniAndTheWeb 2023-05-05 23:57:14 +02:00
  • 173d0e6419 makefile: automatic Arch Linux detection (#1332) DaniAndTheWeb 2023-05-05 23:57:14 +02:00
  • 12b54596b3 ci : add cublas to windows release (#1271) Erik Scholz 2023-05-05 22:56:09 +02:00
  • a3b85b28da ci : add cublas to windows release (#1271) Erik Scholz 2023-05-05 22:56:09 +02:00
  • f415e2f7e5 readme: add missing info (#1324) Pavol Rusnak 2023-05-05 16:43:36 +02:00
  • 921dcee00a readme: add missing info (#1324) Pavol Rusnak 2023-05-05 16:43:36 +02:00
  • 75f22466f9 Fix for OpenCL / clbast builds on macOS. (#1329) Ionoclast Laboratories 2023-05-05 08:18:21 -04:00
  • 2d13786e91 Fix for OpenCL / clbast builds on macOS. (#1329) Ionoclast Laboratories 2023-05-05 08:18:21 -04:00
  • 06aadb7a88 Convert.py @staticmethod (#1327) Benjamin Lecaillon 2023-05-05 02:17:07 +02:00
  • a90e96b266 Convert.py @staticmethod (#1327) Benjamin Lecaillon 2023-05-05 02:17:07 +02:00
  • 8e2c9c0878 quantize: make output filename optional, default to ggml-model-<ftype>.bin (#1301) slaren 2023-05-05 00:58:56 +02:00
  • 94c5652fc0 quantize: make output filename optional, default to ggml-model-<ftype>.bin (#1301) slaren 2023-05-05 00:58:56 +02:00
  • d9e8e1dc92 Wrap exceptions in std::exception to verbose output on exception. (#1316) Ivan Stepanov 2023-05-04 19:56:27 +03:00
  • 34d9f22f44 Wrap exceptions in std::exception to verbose output on exception. (#1316) Ivan Stepanov 2023-05-04 19:56:27 +03:00
  • 682030ff80 convert: support DT_BF16 tensors (#1309) Ivan Stepanov 2023-05-04 19:54:37 +03:00
  • d3e8093e9b convert: support DT_BF16 tensors (#1309) Ivan Stepanov 2023-05-04 19:54:37 +03:00
  • 38022bc8e5 readme : add OpenBuddy link (#1321) 44670 2023-05-05 00:33:31 +08:00
  • 360cfe5bec readme : add OpenBuddy link (#1321) 44670 2023-05-05 00:33:31 +08:00
  • 4f5a7a7519 main : add --in-suffix option (#1318) 44670 2023-05-04 23:41:12 +08:00
  • 2edbdb0f99 main : add --in-suffix option (#1318) 44670 2023-05-04 23:41:12 +08:00
  • 1cd3ed3524 ggml : change immintrin.h to intrin.h for compatibility (#1307) Ron Jailall 2023-05-04 11:05:59 -04:00
  • 20fbf2a2a0 ggml : change immintrin.h to intrin.h for compatibility (#1307) Ron Jailall 2023-05-04 11:05:59 -04:00
  • 682a9c974e Only escape prompts when used with -e (#1311) DannyDaemonic 2023-05-04 05:08:25 -07:00
  • db1080876a Only escape prompts when used with -e (#1311) DannyDaemonic 2023-05-04 05:08:25 -07:00
  • 74f2f681da Update main's README.md with new features (#1296) DannyDaemonic 2023-05-04 03:02:59 -07:00
  • c65a7fbfa9 Update main's README.md with new features (#1296) DannyDaemonic 2023-05-04 03:02:59 -07:00