Files
ik_llama.cpp/examples/speculative
Georgi Gerganov a88f9a8ca8 speculative : PoC for speeding-up inference via speculative sampling (#2926)
* speculative : initial example

* speculative : print encoding speed

* speculative : add --draft CLI arg
2023-09-03 15:12:08 +03:00
..