Commit Graph

  • fa68fa5932 fix #1224 reverse prompt and multi line (#1297) Tomas 2023-05-04 17:02:30 +07:00
  • f647ce040f fix #1224 reverse prompt and multi line (#1297) Tomas 2023-05-04 17:02:30 +07:00
  • 33002a8e03 ggml : vectorize Q8_0 quantization Georgi Gerganov 2023-05-03 23:24:20 +03:00
  • 799fdc1b5d ggml : vectorize Q8_0 quantization Georgi Gerganov 2023-05-03 23:24:20 +03:00
  • 0cb9cc1e2f examples : read chat prompts from a template file (#1196) khimaros 2023-05-03 10:58:11 -07:00
  • 6daa09d879 examples : read chat prompts from a template file (#1196) khimaros 2023-05-03 10:58:11 -07:00
  • aaa2ee9e37 minor : fix whitespaces (#1302) Georgi Gerganov 2023-05-03 20:09:42 +03:00
  • bca9ad938a minor : fix whitespaces (#1302) Georgi Gerganov 2023-05-03 20:09:42 +03:00
  • d989735c4d minor : fix trailing whitespaces Georgi Gerganov 2023-05-03 18:43:23 +03:00
  • e2a937ca6a minor : fix trailing whitespaces Georgi Gerganov 2023-05-03 18:43:23 +03:00
  • 2290898428 scripts : platform independent script to verify sha256 checksums (#1203) KASR 2023-05-03 17:31:28 +02:00
  • b0c71c7b6d scripts : platform independent script to verify sha256 checksums (#1203) KASR 2023-05-03 17:31:28 +02:00
  • 46a50dd4bf examples : various prompt and example fixes (#1298) CRD716 2023-05-03 10:26:47 -05:00
  • a8a2efdc81 examples : various prompt and example fixes (#1298) CRD716 2023-05-03 10:26:47 -05:00
  • 5b067c42a2 llama : only copy used KV cache in get / set state (#1272) Evan Jones 2023-05-02 22:26:13 -04:00
  • e216aa0463 llama : only copy used KV cache in get / set state (#1272) Evan Jones 2023-05-02 22:26:13 -04:00
  • 0f15df3c9e Process escape sequences given in prompts (#1173) DannyDaemonic 2023-05-02 18:46:20 -07:00
  • 2485d7a4d3 Process escape sequences given in prompts (#1173) DannyDaemonic 2023-05-02 18:46:20 -07:00
  • 998d3bf39e Handle signals properly on Windows (#1123) DannyDaemonic 2023-05-02 18:01:57 -07:00
  • 13b0c68ed7 Handle signals properly on Windows (#1123) DannyDaemonic 2023-05-02 18:01:57 -07:00
  • 882fa33772 Call sh on build-info.sh (#1294) DannyDaemonic 2023-05-02 17:52:35 -07:00
  • 55bc5f0900 Call sh on build-info.sh (#1294) DannyDaemonic 2023-05-02 17:52:35 -07:00
  • d3c3b947ab fix build-info.h for git submodules (#1289) kuvaus 2023-05-03 03:43:43 +03:00
  • 9daff419f6 fix build-info.h for git submodules (#1289) kuvaus 2023-05-03 03:43:43 +03:00
  • e859f73be1 fix missing parameters in llama_init_from_gpt_params (#1293) slaren 2023-05-03 01:36:45 +02:00
  • bf4b22ffe4 fix missing parameters in llama_init_from_gpt_params (#1293) slaren 2023-05-03 01:36:45 +02:00
  • 66990fbcc9 examples : add llama_init_from_gpt_params() common function (#1290) Ron Evans 2023-05-02 22:39:51 +02:00
  • 67c77799e0 examples : add llama_init_from_gpt_params() common function (#1290) Ron Evans 2023-05-02 22:39:51 +02:00
  • 922abdc9c8 llama : fix compile warnings Georgi Gerganov 2023-05-02 23:09:08 +03:00
  • 0e6cbff1b7 llama : fix compile warnings Georgi Gerganov 2023-05-02 23:09:08 +03:00
  • 29b3bf28a8 ggml : fix 32-bit ARM Georgi Gerganov 2023-05-02 22:14:50 +03:00
  • 5d5817ca60 ggml : fix 32-bit ARM Georgi Gerganov 2023-05-02 22:14:50 +03:00
  • ae3d46ba30 examples : improve vertical alignment of a few variables (#1286) Ron Evans 2023-05-02 19:53:52 +02:00
  • 8c9be35ff9 examples : improve vertical alignment of a few variables (#1286) Ron Evans 2023-05-02 19:53:52 +02:00
  • 84e8f840b8 ggml : fix ppc64le build error and make cmake detect Power processors (#1284) Marvin Gießing 2023-05-02 18:42:16 +02:00
  • cc0bb7235c ggml : fix ppc64le build error and make cmake detect Power processors (#1284) Marvin Gießing 2023-05-02 18:42:16 +02:00
  • 19855ef65a llama : allow 0 as a seed number. (#1275) Robert Brisita 2023-05-02 12:23:44 -04:00
  • 2bb992f034 llama : allow 0 as a seed number. (#1275) Robert Brisita 2023-05-02 12:23:44 -04:00
  • 890730a4de main : switch input_noecho to input_echo to remove negation (#979) Ron Evans 2023-05-02 18:13:26 +02:00
  • e2cd506999 main : switch input_noecho to input_echo to remove negation (#979) Ron Evans 2023-05-02 18:13:26 +02:00
  • b828c402b3 ggml: add names to tensors (#1268) slaren 2023-05-02 16:03:00 +02:00
  • 2d099e5193 ggml: add names to tensors (#1268) slaren 2023-05-02 16:03:00 +02:00
  • 8f29a9ed07 Add git-based build information for better issue tracking (#1232) DannyDaemonic 2023-05-01 09:23:47 -07:00
  • f4cef87edf Add git-based build information for better issue tracking (#1232) DannyDaemonic 2023-05-01 09:23:47 -07:00
  • 589b8dfe52 cuBLAS: refactor and optimize f16 mat mul performance (#1259) slaren 2023-05-01 18:11:07 +02:00
  • 58b367c2d7 cuBLAS: refactor and optimize f16 mat mul performance (#1259) slaren 2023-05-01 18:11:07 +02:00
  • 772f8bc9bb llama : update stubs for systems without mmap and mlock (#1266) xloem 2023-05-01 08:58:51 -04:00
  • ea3a0ad6b6 llama : update stubs for systems without mmap and mlock (#1266) xloem 2023-05-01 08:58:51 -04:00
  • d9d626efba ggml : fix ggml_used_mem() (#1264) Kerfuffle 2023-05-01 05:56:07 -06:00
  • 2bdc09646d ggml : fix ggml_used_mem() (#1264) Kerfuffle 2023-05-01 05:56:07 -06:00
  • e1103bdf91 llama : fix session load / save (#1263) Georgi Gerganov 2023-05-01 14:54:59 +03:00
  • 70269cae37 llama : fix session load / save (#1263) Georgi Gerganov 2023-05-01 14:54:59 +03:00
  • 83109d07fd cuBLAS: fall back to pageable memory if pinned alloc fails (#1233) slaren 2023-05-01 13:32:22 +02:00
  • b925f1f1b0 cuBLAS: fall back to pageable memory if pinned alloc fails (#1233) slaren 2023-05-01 13:32:22 +02:00
  • b559cfc9a4 llama : let context be const when accessing const data (#1261) Alex Klinkhamer 2023-05-01 00:24:20 -07:00
  • 90b19bd6ee llama : let context be const when accessing const data (#1261) Alex Klinkhamer 2023-05-01 00:24:20 -07:00
  • 96c1b57c16 ggml : fix UB (int << 31) Georgi Gerganov 2023-04-30 22:28:51 +03:00
  • 7ff0dcd320 ggml : fix UB (int << 31) Georgi Gerganov 2023-04-30 22:28:51 +03:00
  • db80eecbb0 build: add armv{6,7,8} support to cmake (#1251) Pavol Rusnak 2023-04-30 20:48:38 +02:00
  • 6f79699286 build: add armv{6,7,8} support to cmake (#1251) Pavol Rusnak 2023-04-30 20:48:38 +02:00
  • 90c41e9a61 common : better default number of threads (#934) jon-chuang 2023-04-30 14:41:35 -04:00
  • a5d30b1f53 common : better default number of threads (#934) jon-chuang 2023-04-30 14:41:35 -04:00
  • ec68c5101d ggml : add CLBlast q5_0, q5_1, q8_0 dequant kernels (#1225) 0cc4m 2023-04-30 20:34:52 +02:00
  • 76a884920a ggml : add CLBlast q5_0, q5_1, q8_0 dequant kernels (#1225) 0cc4m 2023-04-30 20:34:52 +02:00
  • eac5ecc6d1 ggml : add Q5 WASM SIMD + GGML_FTYPE Georgi Gerganov 2023-04-30 19:07:00 +03:00
  • 6bc4400e67 ggml : add Q5 WASM SIMD + GGML_FTYPE Georgi Gerganov 2023-04-30 19:07:00 +03:00
  • 8ceb2442b7 Various fixes to mat_mul benchmark (#1253) Stephan Walter 2023-04-30 12:32:37 +00:00
  • f0d70f147d Various fixes to mat_mul benchmark (#1253) Stephan Walter 2023-04-30 12:32:37 +00:00
  • bbe6fce16a ggml : fix labels for GGML_OP_ALIBI Georgi Gerganov 2023-04-30 10:25:46 +03:00
  • 3e5aa8a1c4 ggml : fix labels for GGML_OP_ALIBI Georgi Gerganov 2023-04-30 10:25:46 +03:00
  • 986f07b45e ggml : fix 32-bit ARM NEON Georgi Gerganov 2023-04-29 21:34:23 +03:00
  • c3ca7a5f05 ggml : fix 32-bit ARM NEON Georgi Gerganov 2023-04-29 21:34:23 +03:00
  • 3d0b3fb66f ggml : use vzip instead of vuzp for consistency Georgi Gerganov 2023-04-29 21:12:56 +03:00
  • e8c051611a ggml : use vzip instead of vuzp for consistency Georgi Gerganov 2023-04-29 21:12:56 +03:00
  • 36bf79f4ba ggml : fix visibility and unused warnings Georgi Gerganov 2023-04-29 19:28:36 +03:00
  • 0b5a935099 ggml : fix visibility and unused warnings Georgi Gerganov 2023-04-29 19:28:36 +03:00
  • 79baa0ef36 ggml : fix #if for f32_f32 mul_mat (CLBlast) (#1229) Georgi Gerganov 2023-04-29 18:43:42 +03:00
  • ec728e44d7 ggml : fix #if for f32_f32 mul_mat (CLBlast) (#1229) Georgi Gerganov 2023-04-29 18:43:42 +03:00
  • e40aa5185e ggml : adjust mul_mat_f16 work memory (#1226) Georgi Gerganov 2023-04-29 18:43:28 +03:00
  • 214b6a3570 ggml : adjust mul_mat_f16 work memory (#1226) Georgi Gerganov 2023-04-29 18:43:28 +03:00
  • 91dc074524 build : fix reference to old llama_util.h Georgi Gerganov 2023-04-29 13:53:12 +03:00
  • 305eb5afd5 build : fix reference to old llama_util.h Georgi Gerganov 2023-04-29 13:53:12 +03:00
  • d483156c17 examples : fix save-load-state + rename llama-util.h Georgi Gerganov 2023-04-29 13:48:11 +03:00
  • 84ca9c2ecf examples : fix save-load-state + rename llama-util.h Georgi Gerganov 2023-04-29 13:48:11 +03:00
  • 1fb89c9913 common : change default parameters to pre-#1126 (#1223) Georgi Gerganov 2023-04-29 09:51:06 +03:00
  • 334637e43e common : change default parameters to pre-#1126 (#1223) Georgi Gerganov 2023-04-29 09:51:06 +03:00
  • d00422bd62 llama : new sampling algorithms (#1126) Ivan Stepanov 2023-04-29 08:34:41 +03:00
  • dd7eff57d8 llama : new sampling algorithms (#1126) Ivan Stepanov 2023-04-29 08:34:41 +03:00
  • 796c107b37 cuBLAS: use host pinned memory and dequantize while copying (#1207) slaren 2023-04-29 02:04:18 +02:00
  • 7fc50c051a cuBLAS: use host pinned memory and dequantize while copying (#1207) slaren 2023-04-29 02:04:18 +02:00
  • 19f0b58347 cuBLAS: non-contiguous tensor support (#1215) Henri Vasserman 2023-04-29 02:31:56 +03:00
  • b1ee8f59b4 cuBLAS: non-contiguous tensor support (#1215) Henri Vasserman 2023-04-29 02:31:56 +03:00
  • 4aeea780bc Remove Q4_3 which is no better than Q5 (#1218) Stephan Walter 2023-04-28 23:10:43 +00:00
  • 36d19a603b Remove Q4_3 which is no better than Q5 (#1218) Stephan Walter 2023-04-28 23:10:43 +00:00
  • 3a8a3891b3 readme : update hot topics Georgi Gerganov 2023-04-28 21:32:52 +03:00
  • 7f15c5c477 readme : update hot topics Georgi Gerganov 2023-04-28 21:32:52 +03:00
  • b4e4b6fde1 ggml : sync ggml (ggml_alibi) Georgi Gerganov 2023-04-28 20:37:43 +03:00
  • 55390bcaf2 ggml : sync ggml (ggml_alibi) Georgi Gerganov 2023-04-28 20:37:43 +03:00
  • c1343a8d6b examples : add Jeopardy example (#1168) CRD716 2023-04-28 11:13:33 -05:00
  • 5fba3c016b examples : add Jeopardy example (#1168) CRD716 2023-04-28 11:13:33 -05:00