mirror of
https://github.com/ikawrakow/ik_llama.cpp.git
synced 2026-05-11 08:30:19 +00:00
Likely not all MLA variants are working. I no longer remember why I added the q8_0 cpy that transposes the tensor, but if really needed, this is now missing. Also missing is q6_0.