mirror of
https://github.com/ikawrakow/ik_llama.cpp.git
synced 2026-03-07 20:40:02 +00:00
Fix README.md
This commit is contained in:
@@ -7,6 +7,7 @@ in each ubatch-sized window. Only a single token sequence is used.
|
||||
The benchmark steps are:
|
||||
|
||||
for each ubatch-sized window in context:
|
||||
|
||||
1. generate ubatch/4 tokens (not the whole window to save some time)
|
||||
2. measure generation performance
|
||||
3. remove generated tokens from KV cache
|
||||
|
||||
Reference in New Issue
Block a user