mirror of
https://github.com/ikawrakow/ik_llama.cpp.git
synced 2026-02-26 16:14:10 +00:00
Slightly (~1%) better by keeping qkv accumulators on the stack and only storing the final result into the qkv tensor.
Slightly (~1%) better by keeping qkv accumulators on the stack and only storing the final result into the qkv tensor.