Files
ik_llama.cpp/src
Kawrakow cb30f8e057 Merge Q and K into a single tensor (#892)
* Merge Q and K into a single tensor

* Make V mul mat follow QK mul mat

so they can be fused, which gives a slightly bbetter TG performance.

---------

Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
2025-11-05 10:54:36 +02:00
..
2025-11-03 18:42:20 +02:00
2025-10-30 10:49:48 +02:00
2025-10-30 10:49:48 +02:00
2025-06-19 10:24:53 +03:00
2024-07-27 07:55:01 +02:00
2024-07-27 07:55:01 +02:00