ik_llama.cpp

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-01-26 17:20:01 +00:00

Files

Georgi Gerganov c40b5b3d59 Introduce C-style API (#370 )

* Major refactoring - introduce C-style API

* Clean up

* Add <cassert>

* Add <iterator>

* Add <algorithm> ....

* Fix timing reporting and accumulation

* Measure eval time only for single-token calls

* Change llama_tokenize return meaning

2023-03-22 07:32:36 +02:00

ggml-vocab.bin

Introduce C-style API (#370 )

2023-03-22 07:32:36 +02:00