Files
ik_llama.cpp/ggml/src
Kawrakow 63d0389e18 WIP split mode attn
Works for LlaMA models, but not for GLM-4.5.
Doesn't seem to improve performance, so I guess no point in trying to
fix it.
2025-12-01 09:34:14 +00:00
..
2025-11-30 18:05:13 +00:00
2025-11-24 06:55:14 +01:00
2024-07-27 07:55:01 +02:00
2024-07-27 07:55:01 +02:00
2025-11-24 06:55:14 +01:00
2024-07-27 07:55:01 +02:00
2025-12-01 09:34:14 +00:00
2025-08-09 08:40:18 +03:00
2025-08-09 08:40:18 +03:00
2025-08-09 08:40:18 +03:00
2025-08-27 08:03:47 +03:00
2025-11-30 18:05:13 +00:00