Files
ik_llama.cpp/ggml
Iwan Kawrakow f8a7dadbb7 FlashMLA: it now works with iqk
I had forgotten to divide the Q stride by sizeof(float) and
that's why, very cobfusingly, it was working for TG but not for PP.
2025-03-03 07:47:08 +02:00
..
2024-07-27 07:55:01 +02:00
2025-03-03 07:47:08 +02:00
2024-07-27 07:55:01 +02:00