Kawrakow
1a4cfbcc53
Merge mainline - Aug 12 2024 ( #17 )
...
* Merge mainline
* Fix after merge
* Remove CI check
---------
Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com >
2024-08-12 15:14:32 +02:00
Kawrakow
0ceeb11721
Merge mainline llama.cpp ( #3 )
...
* Merging mainline - WIP
* Merging mainline - WIP
AVX2 and CUDA appear to work.
CUDA performance seems slightly (~1-2%) lower as it is so often
the case with llama.cpp/ggml after some "improvements" have been made.
* Merging mainline - fix Metal
* Remove check
---------
Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com >
2024-07-27 07:55:01 +02:00
Georgi Gerganov
a1a23d5f3e
ggml : sync
2024-06-18 09:50:45 +03:00
Georgi Gerganov
89012dad24
sync : ggml
2024-05-29 14:29:52 +03:00
Georgi Gerganov
ef6181c079
sync : ggml
2024-05-15 13:23:41 +03:00
Georgi Gerganov
b0390e32cf
sync : ggml
...
ggml-ci
2024-05-14 19:08:09 +03:00
Georgi Gerganov
51855c8a4d
metal : fix warnings (skipme) ( #0 )
2024-05-11 21:38:13 +03:00
Georgi Gerganov
582ea6f97b
sync : ggml
2024-05-11 21:35:05 +03:00
Georgi Gerganov
fa6762f4a1
sync : ggml
...
ggml-ci
2024-05-11 15:38:34 +03:00
Georgi Gerganov
61e36e1532
sync : ggml
2024-04-09 20:29:06 +03:00
Georgi Gerganov
2c80dc319b
sync : ggml
2024-04-07 17:05:51 +03:00
Georgi Gerganov
0bcde06402
sync : ggml
2024-04-06 18:27:46 +03:00
Georgi Gerganov
2c0917ae95
sync : ggml
2024-03-10 20:10:46 +02:00
Georgi Gerganov
96ea2cb08d
sync : ggml
...
ggml-ci
2024-03-04 20:54:23 +02:00
Georgi Gerganov
80b5fed6d0
sync : ggml
2024-03-04 10:40:04 +02:00
Georgi Gerganov
49255c8c2e
sync : ggml
2024-02-28 11:17:32 +02:00
Georgi Gerganov
dfb0d63843
sync : ggml
2024-02-22 23:21:05 +02:00
Georgi Gerganov
f85c73f1d0
sync : ggml
2024-02-21 16:52:52 +02:00
Georgi Gerganov
0b5d2708a7
sync : ggml ( #5633 )
...
* ggml : fix conv_2d batch mode (ggml/737)
Co-authored-by: bssrdf <bssrdf@gmail.com >
* ggml : compute forward no longer pass src tensors (ggml/729)
* sync : ggml
ggml-ci
---------
Co-authored-by: bssrdf <merlintiger@hotmail.com >
Co-authored-by: bssrdf <bssrdf@gmail.com >
2024-02-21 16:17:10 +02:00
Georgi Gerganov
3a1f76bfc6
sync : ggml
...
ggml-ci
2024-02-19 15:09:43 +02:00
Georgi Gerganov
0ca4e0c14c
sync : ggml ( #5452 )
...
* ggml-alloc : v3 (ggml/727)
* ggml-alloc v3
ggml-ci
* fix ci
ggml-ci
* whisper : check for backend buffer allocation failures
* whisper : avoid leaks when initialization fails
* cleanup
ggml-ci
* style fixes
ggml-ci
* sync : ggml
* update llama.cpp, clip.cpp, export-lora.cpp
* update finetune.cpp, train-text-from-scratch.cpp
ggml-ci
* ggml-backend : reduce alignment to 32 to match gguf and fix mmap
---------
Co-authored-by: slaren <slarengh@gmail.com >
2024-02-12 09:16:06 +02:00
Georgi Gerganov
14d2486167
sync : ggml
2024-02-10 09:30:36 +02:00
Georgi Gerganov
bb6467bd41
sync : ggml ( #0 )
2024-01-30 16:21:57 +02:00
Georgi Gerganov
c1aa8de15f
sync : ggml
2024-01-28 19:48:05 +02:00
Georgi Gerganov
32e84d74f1
sync : ggml
2024-01-27 17:00:24 +02:00
Georgi Gerganov
af72b6c82d
sync : ggml
2024-01-17 20:54:50 +02:00
Georgi Gerganov
97e013bd09
scripts : sync-ggml-am.sh option to skip commits
2024-01-14 11:08:41 +02:00
Georgi Gerganov
2e51f37f77
sync : ggml
2024-01-14 00:14:46 +02:00
Georgi Gerganov
211567ae1a
sync : ggml
2024-01-12 22:02:43 +02:00
Georgi Gerganov
d3d92fe41c
sync : ggml
2024-01-11 09:39:08 +02:00
Georgi Gerganov
7e27e37f26
metal : switch back to default.metallib (ggml/681)
...
ggml-ci
2024-01-05 18:02:06 +02:00
Georgi Gerganov
f2001ff46d
cuda : simplify expression
...
Co-authored-by: slaren <slarengh@gmail.com >
2024-01-03 14:38:38 +02:00
Georgi Gerganov
4ebea0bdce
sync : ggml
...
ggml-ci
2024-01-03 14:38:38 +02:00
Georgi Gerganov
2753b503bc
scripts : print list of sync commits
2023-12-29 15:12:35 +02:00
Georgi Gerganov
ff7ec2ba2c
sync : ggml
2023-12-29 14:56:41 +02:00
Georgi Gerganov
95dad80615
scripts : add sync-ggml-am.sh
2023-12-27 11:44:22 +02:00