mirror of
https://github.com/ikawrakow/ik_llama.cpp.git
synced 2026-04-30 19:31:48 +00:00
Update sweep bench (depracating .jsonl support) (#289)
* Update sweep bench (depracating .jsonl support) * Fix README.md
This commit is contained in:
@@ -7,6 +7,7 @@ in each ubatch-sized window. Only a single token sequence is used.
|
||||
The benchmark steps are:
|
||||
|
||||
for each ubatch-sized window in context:
|
||||
|
||||
1. generate ubatch/4 tokens (not the whole window to save some time)
|
||||
2. measure generation performance
|
||||
3. remove generated tokens from KV cache
|
||||
|
||||
Reference in New Issue
Block a user