Commit Graph

  • 65702a083e k-quants : fix build on armv7 (android only) (#2920) Jhen-Jie Hong 2023-09-02 20:23:45 +08:00
  • 21f3d1be86 k-quants : fix build on armv7 (android only) (#2920) Jhen-Jie Hong 2023-09-02 20:23:45 +08:00
  • beed103e3d server : avoid aniprompt in probabilities of final response (#2849) Jhen-Jie Hong 2023-09-02 08:31:46 +08:00
  • 571083f508 server : avoid aniprompt in probabilities of final response (#2849) Jhen-Jie Hong 2023-09-02 08:31:46 +08:00
  • 26205b02a4 cuda : vsubss4 for older versions of ROCm/clang (#2942) Engininja2 2023-09-01 15:33:19 -06:00
  • f04d002844 cuda : vsubss4 for older versions of ROCm/clang (#2942) Engininja2 2023-09-01 15:33:19 -06:00
  • 0e58794306 readme : quick start command fix (#2908) ZHAOKAI WANG 2023-09-01 22:06:44 +08:00
  • 69fdbb9abc readme : quick start command fix (#2908) ZHAOKAI WANG 2023-09-01 22:06:44 +08:00
  • 2ac8bf40d0 Allow quantize to only copy tensors, some other improvements (#2931) Kerfuffle 2023-09-01 08:02:48 -06:00
  • 5d6f19f16b Allow quantize to only copy tensors, some other improvements (#2931) Kerfuffle 2023-09-01 08:02:48 -06:00
  • 7166fdca0f llama2c : rename function Georgi Gerganov 2023-09-01 17:00:40 +03:00
  • 0d58936686 llama2c : rename function Georgi Gerganov 2023-09-01 17:00:40 +03:00
  • 6fd8d848b3 make : use unaligned vector moves on MinGW (#2945) Cebtenzzre 2023-09-01 09:53:14 -04:00
  • 6c9c23429b make : use unaligned vector moves on MinGW (#2945) Cebtenzzre 2023-09-01 09:53:14 -04:00
  • 03c5668102 minor : add const qualifiers (#2853) m3ndax 2023-09-01 15:47:27 +02:00
  • ee8654bcd0 minor : add const qualifiers (#2853) m3ndax 2023-09-01 15:47:27 +02:00
  • 80569510d8 docs : add java-llama.cpp to README.md (#2935) Konstantin Herud 2023-09-01 15:36:14 +02:00
  • 49bb9cbe0f docs : add java-llama.cpp to README.md (#2935) Konstantin Herud 2023-09-01 15:36:14 +02:00
  • a512948d93 build : fix most gcc and clang warnings (#2861) Cebtenzzre 2023-09-01 09:34:50 -04:00
  • ef15649972 build : fix most gcc and clang warnings (#2861) Cebtenzzre 2023-09-01 09:34:50 -04:00
  • 5b374b4cfe examples : add C grammar (#2357) Ben Siraphob 2023-09-01 09:32:14 -04:00
  • d8d6977f48 examples : add C grammar (#2357) Ben Siraphob 2023-09-01 09:32:14 -04:00
  • fcaf9592ce ggml : add RISC-V vector intrinsics support (#2929) Tameem 2023-09-01 18:27:40 +05:00
  • 5aec2cfaac ggml : add RISC-V vector intrinsics support (#2929) Tameem 2023-09-01 18:27:40 +05:00
  • 318671a698 metal : slight speed-up for add and mul kernels (#2917) Georgi Gerganov 2023-09-01 13:42:41 +03:00
  • 13268c5331 metal : slight speed-up for add and mul kernels (#2917) Georgi Gerganov 2023-09-01 13:42:41 +03:00
  • 2f55c84496 logs : fix mingw-like builds (fixes #2898) (#2911) staviq 2023-09-01 11:07:06 +02:00
  • 4dcd47d71d logs : fix mingw-like builds (fixes #2898) (#2911) staviq 2023-09-01 11:07:06 +02:00
  • a108b6ee7c llama2c : fix segfault and alloc-dealloc-mismatch (#2913) Cebtenzzre 2023-09-01 05:03:49 -04:00
  • 18705a30ef llama2c : fix segfault and alloc-dealloc-mismatch (#2913) Cebtenzzre 2023-09-01 05:03:49 -04:00
  • eb01667ea3 metal: somewhat faster f16 x f32 matrix multiply kernel (#2951) Kawrakow 2023-09-01 11:15:57 +03:00
  • e8d9158925 metal: somewhat faster f16 x f32 matrix multiply kernel (#2951) Kawrakow 2023-09-01 11:15:57 +03:00
  • 2d2470a0a4 convert : fix another python 3.8 issue (#2949) Cebtenzzre 2023-08-31 22:13:51 -04:00
  • bce1fef328 convert : fix another python 3.8 issue (#2949) Cebtenzzre 2023-08-31 22:13:51 -04:00
  • 25c94905a8 remove convert-llama-7b-pth-to-gguf.py and convert-llama-hf-to-gguf.py (#2906) slaren 2023-09-01 01:32:09 +02:00
  • 528134dd02 remove convert-llama-7b-pth-to-gguf.py and convert-llama-hf-to-gguf.py (#2906) slaren 2023-09-01 01:32:09 +02:00
  • 0b0642e86d scripts: Use local gguf package when running from repo (#2927) Kerfuffle 2023-08-31 16:49:24 -06:00
  • aeefac4ff7 scripts: Use local gguf package when running from repo (#2927) Kerfuffle 2023-08-31 16:49:24 -06:00
  • 16fb7078b6 @vxiiduu's fix for PrefetchVirtualMemory (#2930) DannyDaemonic 2023-08-31 04:21:45 -07:00
  • e8422de39e @vxiiduu's fix for PrefetchVirtualMemory (#2930) DannyDaemonic 2023-08-31 04:21:45 -07:00
  • 6ea4430638 convert : fix python 3.8 support, modernize type annotations (#2916) Cebtenzzre 2023-08-31 01:02:23 -04:00
  • 92d0b751a7 convert : fix python 3.8 support, modernize type annotations (#2916) Cebtenzzre 2023-08-31 01:02:23 -04:00
  • d638c0a4ed CUDA: mul_mat_q=true llama_context_params default (#2912) Johannes Gäßler 2023-08-30 21:46:19 +02:00
  • 8afe228000 CUDA: mul_mat_q=true llama_context_params default (#2912) Johannes Gäßler 2023-08-30 21:46:19 +02:00
  • 41ac8d7ab3 [Docker] fix tools.sh argument passing. (#2884) Henri Vasserman 2023-08-30 19:14:53 +03:00
  • 71d6975559 [Docker] fix tools.sh argument passing. (#2884) Henri Vasserman 2023-08-30 19:14:53 +03:00
  • c0b289f1bc convert.py : use dir name to name the llama Georgi Gerganov 2023-08-30 13:29:40 +03:00
  • b532a69b2f convert.py : use dir name to name the llama Georgi Gerganov 2023-08-30 13:29:40 +03:00
  • 77870051cb examples : fix underscore in beam-search + .gitignore (close #2900) Georgi Gerganov 2023-08-30 12:52:46 +03:00
  • c90d135eb4 examples : fix underscore in beam-search + .gitignore (close #2900) Georgi Gerganov 2023-08-30 12:52:46 +03:00
  • 1bccb29d03 gguf : add workflow for Pypi publishing (#2896) M. Yusuf Sarıgöz 2023-08-30 12:47:40 +03:00
  • 0d1c706181 gguf : add workflow for Pypi publishing (#2896) M. Yusuf Sarıgöz 2023-08-30 12:47:40 +03:00
  • 944a1ab5f1 make : add test and update CI (#2897) alonfaraj 2023-08-30 12:42:51 +03:00
  • 9509294420 make : add test and update CI (#2897) alonfaraj 2023-08-30 12:42:51 +03:00
  • b138430852 docs : add node-llama-cpp to README.md (#2885) Gilad S 2023-08-30 11:40:12 +03:00
  • 35092fb547 docs : add node-llama-cpp to README.md (#2885) Gilad S 2023-08-30 11:40:12 +03:00
  • 99ff88f7f9 convert : various script cleanups/fixes + merges and special token handling (#2842) Kerfuffle 2023-08-30 02:25:50 -06:00
  • dc07dc492e convert : various script cleanups/fixes + merges and special token handling (#2842) Kerfuffle 2023-08-30 02:25:50 -06:00
  • d422840506 llm.vim : stop generation at multiple linebreaks, bind to <F2> (#2879) chaihahaha 2023-08-30 14:50:55 +08:00
  • ad9ddcff6e llm.vim : stop generation at multiple linebreaks, bind to <F2> (#2879) chaihahaha 2023-08-30 14:50:55 +08:00
  • a2588b53e1 main : log file (#2748) staviq 2023-08-30 08:29:32 +02:00
  • 8341a25957 main : log file (#2748) staviq 2023-08-30 08:29:32 +02:00
  • f1fadcda14 tests : add a C compliance test (#2848) Cebtenzzre 2023-08-30 02:20:26 -04:00
  • 849408957c tests : add a C compliance test (#2848) Cebtenzzre 2023-08-30 02:20:26 -04:00
  • 35e64a9fe9 ggml : add view_src and view_offs to ggml_tensor for views (#2874) slaren 2023-08-29 23:24:42 +02:00
  • 06abf8eeba ggml : add view_src and view_offs to ggml_tensor for views (#2874) slaren 2023-08-29 23:24:42 +02:00
  • 0cfa148196 remove outdated references to -eps and -gqa from README (#2881) slaren 2023-08-29 23:17:34 +02:00
  • c03a243abf remove outdated references to -eps and -gqa from README (#2881) slaren 2023-08-29 23:17:34 +02:00
  • 2b2942e04e Tell users attmepting to run perplexity with too few tokens to use more (#2882) Kawrakow 2023-08-29 23:55:45 +03:00
  • fa3582f509 Tell users attmepting to run perplexity with too few tokens to use more (#2882) Kawrakow 2023-08-29 23:55:45 +03:00
  • 600d10f322 10X faster BPE tokenizer (#2876) Kawrakow 2023-08-29 23:55:03 +03:00
  • e37e69dcc3 10X faster BPE tokenizer (#2876) Kawrakow 2023-08-29 23:55:03 +03:00
  • 80d1d0bc3d py : fix "usage" messages (#2873) maddes8cht 2023-08-29 15:51:02 +02:00
  • 53885d7256 py : fix "usage" messages (#2873) maddes8cht 2023-08-29 15:51:02 +02:00
  • 301850d5cd convert.py : fix baichuan7B support (#2870) jameswu2014 2023-08-29 17:48:41 +08:00
  • bcce96ba4d convert.py : fix baichuan7B support (#2870) jameswu2014 2023-08-29 17:48:41 +08:00
  • 910d0f2660 readme : add react-native binding (#2869) Jhen-Jie Hong 2023-08-29 17:30:10 +08:00
  • 74e0caeb82 readme : add react-native binding (#2869) Jhen-Jie Hong 2023-08-29 17:30:10 +08:00
  • 633d1f781c make : fix clang tests build, add missing examples (#2859) Cebtenzzre 2023-08-29 04:42:41 -04:00
  • d4b5e16c32 make : fix clang tests build, add missing examples (#2859) Cebtenzzre 2023-08-29 04:42:41 -04:00
  • 4fd19931fa metal : add option to disable debug logs (close #2764) Georgi Gerganov 2023-08-29 11:33:46 +03:00
  • 3a007648f2 metal : add option to disable debug logs (close #2764) Georgi Gerganov 2023-08-29 11:33:46 +03:00
  • d51d0c510b scripts : add pipefail Georgi Gerganov 2023-08-29 10:50:30 +03:00
  • 611363ac79 scripts : add pipefail Georgi Gerganov 2023-08-29 10:50:30 +03:00
  • 660edd2ea0 added struct to llama_dump_timing_info_yaml's llama_context (#2857) Marcus Dunn 2023-08-28 23:33:27 -07:00
  • 95b6e5212f added struct to llama_dump_timing_info_yaml's llama_context (#2857) Marcus Dunn 2023-08-28 23:33:27 -07:00
  • 72b3a90abb train : mem usage and other improvements (#2439) xaedes 2023-08-28 21:51:47 +02:00
  • 44c117f41e train : mem usage and other improvements (#2439) xaedes 2023-08-28 21:51:47 +02:00
  • ba5ebe118c llama-bench : set locale to utf8 (#2832) slaren 2023-08-28 19:19:18 +02:00
  • 43033b7bb4 llama-bench : set locale to utf8 (#2832) slaren 2023-08-28 19:19:18 +02:00
  • 76c5d8c5a4 YAML result logging + preset script (#2657) Johannes Gäßler 2023-08-28 17:59:39 +02:00
  • 6b73ef1201 YAML result logging + preset script (#2657) Johannes Gäßler 2023-08-28 17:59:39 +02:00
  • 3812bd3ad1 make : fix tests build (#2855) alonfaraj 2023-08-28 18:38:35 +03:00
  • 75fafcbccc make : fix tests build (#2855) alonfaraj 2023-08-28 18:38:35 +03:00
  • bfc3b0b70d llama.cpp : fix wrong vsnprintf call in MS compiler (#2856) grahameth 2023-08-28 17:38:12 +02:00
  • be475f60af llama.cpp : fix wrong vsnprintf call in MS compiler (#2856) grahameth 2023-08-28 17:38:12 +02:00
  • 3a3b7dc79f ggml : tiny ggml_vec_dot_q4_K_q8_K AVX2 improvement (#2819) Ronny Brendel 2023-08-28 14:51:08 +02:00
  • 3af6b86301 ggml : tiny ggml_vec_dot_q4_K_q8_K AVX2 improvement (#2819) Ronny Brendel 2023-08-28 14:51:08 +02:00
  • 6b3c1a820c ggml : sync (mem align to header + conv_transpose_2d fixes + ggml_alloc) (#2852) Georgi Gerganov 2023-08-28 14:24:53 +03:00
  • 35feac6560 ggml : sync (mem align to header + conv_transpose_2d fixes + ggml_alloc) (#2852) Georgi Gerganov 2023-08-28 14:24:53 +03:00