ik_llama.cpp

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-04-22 23:49:23 +00:00

Author	SHA1	Message	Date
Thomas Klausner	d3ed019b74	Add NetBSD support. (#90 )	2023-03-13 18:40:54 +02:00
Georgi Gerganov	c1eebc2a25	Use vdotq_s32 to improve performance (#67 ) * 10% performance boost on ARM * Back to original change	2023-03-13 18:36:44 +02:00
Georgi Gerganov	49a8c7675b	Revert "10% performance boost on ARM" This reverts commit `113a9e83eb`. There are some reports for illegal instruction. Moved this stuff to vdotq_s32 branch until resolve	2023-03-13 01:28:08 +02:00
Georgi Gerganov	c47fa0ea5e	Check for vdotq_s32 availability	2023-03-13 01:21:03 +02:00
Georgi Gerganov	c00675331e	Ammend to previous commit - forgot to update non-QRDMX branch	2023-03-13 01:05:24 +02:00
Georgi Gerganov	f48b7628ea	10% performance boost on ARM	2023-03-13 00:56:10 +02:00
Sebastián A	fde84afbed	Windows fixes (#31 ) * Apply fixes suggested to build on windows Issue: https://github.com/ggerganov/llama.cpp/issues/22 * Remove unsupported VLAs * MSVC: Remove features that are only available on MSVC C++20. * Fix zero initialization of the other fields. * Change the use of vector for stack allocations.	2023-03-12 22:15:00 +02:00
Georgi Gerganov	cc0f26bef3	Add AVX2 support for x86 architectures thanks to @Const-me !	2023-03-11 18:04:25 +02:00
Georgi Gerganov	a2799521b9	Support all LLaMA models + change Q4_0 quantization storage	2023-03-11 11:28:30 +02:00
Georgi Gerganov	4b5b86d6ee	Initial release	2023-03-10 20:56:40 +02:00