Fix README.md

This commit is contained in:
Saood Karim
2025-03-25 09:15:19 -05:00
parent 2fd035b43f
commit 7a6f681daf

View File

@@ -7,6 +7,7 @@ in each ubatch-sized window. Only a single token sequence is used.
The benchmark steps are:
for each ubatch-sized window in context:
1. generate ubatch/4 tokens (not the whole window to save some time)
2. measure generation performance
3. remove generated tokens from KV cache