ik_llama.cpp

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-02-09 07:50:10 +00:00

Author	SHA1	Message	Date
vodkaslime	bc91cdbd87	readme : fix (#4135 ) * fix: readme * chore: resolve comments * chore: resolve comments	2023-11-30 23:49:21 +02:00
Dawid Wysocki	8f1e6fbde7	readme : fix typo (#4253 ) llama.cpp uses GitHub Actions, not Gitlab Actions.	2023-11-30 23:43:32 +02:00
Peter Sugihara	d119cde4a5	readme : add FreeChat (#4248 )	2023-11-29 09:16:34 +02:00
Kasumi	2cf38d14b2	readme : add Amica to UI list (#4230 )	2023-11-27 19:39:42 +02:00
Georgi Gerganov	6f7d280455	readme : update hot topics	2023-11-26 20:42:51 +02:00
Georgi Gerganov	e5d642885c	readme : update hot topics	2023-11-25 12:02:13 +02:00
Aaryaman Vasishta	92eb4cdab4	readme : use PATH for Windows ROCm (#4195 ) * Update README.md to use PATH for Windows ROCm * Update README.md * Update README.md	2023-11-24 09:52:39 +02:00
Georgi Gerganov	a8e65a6b4c	readme : update hot topics	2023-11-23 13:51:22 +02:00
Aaryaman Vasishta	94da394760	readme : update ROCm Windows instructions (#4122 ) * Update README.md * Update README.md Co-authored-by: Jared Van Bortel <cebtenzzre@gmail.com> --------- Co-authored-by: Jared Van Bortel <cebtenzzre@gmail.com>	2023-11-20 17:02:46 +02:00
Galunid	d200fc170a	stablelm : StableLM support (#3586 ) * Add support for stablelm-3b-4e1t * Supports GPU offloading of (n-1) layers	2023-11-14 11:17:12 +01:00
Georgi Gerganov	5940637098	readme : update hot topics	2023-11-13 14:18:08 +02:00
Richard Kiss	a05fccf374	Fix some documentation typos/grammar mistakes (#4032 ) * typos * Update examples/parallel/README.md Co-authored-by: Kerfuffle <44031344+KerfuffleV2@users.noreply.github.com> --------- Co-authored-by: Kerfuffle <44031344+KerfuffleV2@users.noreply.github.com>	2023-11-11 23:04:58 -07:00
Georgi Gerganov	534bbd5c14	readme : add notice about #3912	2023-11-02 20:44:12 +02:00
Ian Scrivener	21a26a6dea	readme : remove unsupported node.js library (#3703 ) - https://github.com/Atome-FE/llama-node is quite out of date - doesn't support recent/current llama.cpp functionality	2023-10-22 21:16:43 +03:00
Georgi Gerganov	ede7949722	sampling : refactor init to use llama_sampling_params (#3696 ) * sampling : refactor init to use llama_sampling_params * llama : combine repetition, frequency and presence penalties in 1 call * examples : remove embd-input and gptneox-wip * sampling : rename penalty params + reduce size of "prev" vector * sampling : add llama_sampling_print helper * sampling : hide prev behind API and apply #3661 ggml-ci	2023-10-20 21:07:23 +03:00
Georgi Gerganov	f9bbb76017	readme : update hot topics	2023-10-18 21:44:43 +03:00
BarfingLemurs	2404ccf7ab	readme : update hot-topics & models, detail windows release in usage (#3615 ) * Update README.md * Update README.md * Update README.md * move "Running on Windows" section below "Prepare data and run" --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	2023-10-17 21:13:21 +03:00
ldwang	e49cde7ded	readme : add Aquila2 links (#3610 ) Signed-off-by: ldwang <ftgreat@gmail.com> Co-authored-by: ldwang <ftgreat@gmail.com>	2023-10-17 18:52:33 +03:00
Ian Scrivener	3ee11e89e1	typo : it is `--n-gpu-layers` not `--gpu-layers` (#3592 ) fixed a typo in the MacOS Metal run doco	2023-10-12 14:10:50 +03:00
Galunid	a637869df6	Add MPT model to supported models in README.md (#3574 )	2023-10-10 19:02:49 -04:00
Xingchen Song(宋星辰)	8994c485e9	readme : add bloom (#3570 )	2023-10-10 19:28:50 +03:00
BarfingLemurs	3226b5d74b	readme : update models, cuda + ppl instructions (#3510 )	2023-10-06 22:13:36 +03:00
Georgi Gerganov	1ded9d4793	readme : add project status link	2023-10-04 16:50:44 +03:00
slaren	a18aa627fa	llama.cpp : add documentation about rope_freq_base and scale values (#3401 ) * llama.cpp : add documentation about rope_freq_base and scale values * add notice to hot topics	2023-09-29 18:42:32 +02:00
BarfingLemurs	6706639c45	readme : update hot topics + model links (#3399 )	2023-09-29 15:50:35 +03:00
Andrew Duffy	93527803e3	readme : add link to grammars app (#3388 ) * Add link to grammars app per @ggernagov suggestion Adding a sentence in the Grammars section of README to point to grammar app, per https://github.com/ggerganov/llama.cpp/discussions/2494#discussioncomment-7138211 * Update README.md	2023-09-29 14:15:57 +03:00
Pierre Alexandre SCHEMBRI	6580c05d1c	readme : add Mistral AI release 0.1 (#3362 )	2023-09-28 15:13:37 +03:00
BarfingLemurs	9d92d67428	readme : add some recent perplexity and bpw measurements to READMES, link for k-quants (#3340 ) * Update README.md * Update README.md * Update README.md with k-quants bpw measurements	2023-09-27 18:30:36 +03:00
2f38b454	be8fb3dc9b	docs: Fix typo CLBlast_DIR var. (#3330 )	2023-09-25 20:24:52 +02:00
Lee Drake	1e8ebda8ce	Update README.md (#3289 ) * Update README.md * Update README.md Co-authored-by: slaren <slarengh@gmail.com> --------- Co-authored-by: slaren <slarengh@gmail.com>	2023-09-21 21:00:24 +02:00
Georgi Gerganov	7eca40bf4b	readme : update hot topics	2023-09-20 20:48:22 +03:00
Johannes Gäßler	94a0ea6e76	CUDA: enable peer access between devices (#2470 )	2023-09-17 16:37:53 +02:00
dylan	61cead9a5b	docker : add gpu image CI builds (#3103 ) Enables the GPU enabled container images to be built and pushed alongside the CPU containers. Co-authored-by: canardleteer <eris.has.a.dad+github@gmail.com>	2023-09-14 19:47:00 +03:00
Ikko Eltociear Ashimine	8db00f111b	readme : fix typo (#3043 ) * readme : fix typo acceleation -> acceleration * Update README.md --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	2023-09-08 19:04:32 +03:00
Georgi Gerganov	e0997d46fe	readme : update hot tpoics	2023-09-08 18:18:04 +03:00
Yui	b897d9e7a6	Update deprecated GGML TheBloke links to GGUF (#3079 )	2023-09-08 12:32:55 +02:00
Georgi Gerganov	8e49675a7b	build : on Mac OS enable Metal by default (#2901 ) * build : on Mac OS enable Metal by default * make : try to fix build on Linux * make : move targets back to the top * make : fix target clean * llama : enable GPU inference by default with Metal * llama : fix vocab_only logic when GPU is enabled * common : better `n_gpu_layers` assignment * readme : update Metal instructions * make : fix merge conflict remnants * gitignore : metal	2023-09-04 22:26:24 +03:00
Ido S	a8b85ea614	docs : add `catai` to `README.md` (#2967 )	2023-09-03 08:50:51 +03:00
bandoti	626da973c4	readme : update clblast instructions (#2903 ) * Update Windows CLBlast instructions * Update Windows CLBlast instructions * Remove trailing whitespace	2023-09-02 15:53:18 +03:00
Konstantin Herud	80569510d8	docs : add java-llama.cpp to README.md (#2935 )	2023-09-01 16:36:14 +03:00
Gilad S	b138430852	docs : add `node-llama-cpp` to `README.md` (#2885 )	2023-08-30 11:40:12 +03:00
slaren	0cfa148196	remove outdated references to -eps and -gqa from README (#2881 )	2023-08-29 23:17:34 +02:00
Jhen-Jie Hong	910d0f2660	readme : add react-native binding (#2869 )	2023-08-29 12:30:10 +03:00
Georgi Gerganov	a40c1d87ff	readme : fix headings	2023-08-27 15:52:34 +03:00
Georgi Gerganov	5a7aaa5f74	readme : update hot topics	2023-08-27 14:44:35 +03:00
Henri Vasserman	984b7495ed	ROCm Port (#1087 ) * use hipblas based on cublas * Update Makefile for the Cuda kernels * Expand arch list and make it overrideable * Fix multi GPU on multiple amd architectures with rocblas_initialize() (#5) * add hipBLAS to README * new build arg LLAMA_CUDA_MMQ_Y * fix half2 decomposition * Add intrinsics polyfills for AMD * AMD assembly optimized __dp4a * Allow overriding CC_TURING * use "ROCm" instead of "CUDA" * ignore all build dirs * Add Dockerfiles * fix llama-bench * fix -nommq help for non CUDA/HIP --------- Co-authored-by: YellowRoseCx <80486540+YellowRoseCx@users.noreply.github.com> Co-authored-by: ardfork <134447697+ardfork@users.noreply.github.com> Co-authored-by: funnbot <22226942+funnbot@users.noreply.github.com> Co-authored-by: Engininja2 <139037756+Engininja2@users.noreply.github.com> Co-authored-by: Kerfuffle <44031344+KerfuffleV2@users.noreply.github.com> Co-authored-by: jammm <2500920+jammm@users.noreply.github.com> Co-authored-by: jdecourval <7315817+jdecourval@users.noreply.github.com>	2023-08-25 12:09:42 +03:00
Georgi Gerganov	fc84c48240	readme : fix link	2023-08-23 23:44:19 +03:00
Georgi Gerganov	1fac3b2c0b	minor : fix trailing whitespace	2023-08-23 23:43:00 +03:00
Georgi Gerganov	eb5bf4480c	readme : update hot topics	2023-08-23 23:41:16 +03:00
Evan Jones	943bf8930c	docs : add grammar docs (#2701 ) * docs : add grammar docs * tweaks to grammar guide * rework GBNF example to be a commented grammar	2023-08-22 21:01:57 -04:00

1 2 3 4 5

214 Commits