Files
ik_llama.cpp/ggml
Iwan Kawrakow 486dc0b3ee CUDA graphs - seems to be working
Likely not all MLA variants are working.
I no longer remember why I added the q8_0 cpy that
transposes the tensor, but if really needed, this is now
missing. Also missing is q6_0.
2025-08-14 12:00:42 +03:00
..
2024-07-27 07:55:01 +02:00
2025-08-14 12:00:42 +03:00
2024-07-27 07:55:01 +02:00