ik_llama.cpp

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-05-11 00:20:19 +00:00

Files

Kawrakow 63d0389e18 WIP split mode attn

Works for LlaMA models, but not for GLM-4.5.
Doesn't seem to improve performance, so I guess no point in trying to
fix it.

2025-12-01 09:34:14 +00:00

2024-07-27 07:55:01 +02:00

WIP

2025-11-30 18:05:12 +00:00

WIP split mode attn

2025-12-01 09:34:14 +00:00

.gitignore

2024-07-27 07:55:01 +02:00

CMakeLists.txt

2025-11-11 10:35:48 +02:00