Files
ik_llama.cpp/ggml
Kawrakow a2f5614529 Try to split offloaded MoE up/gate up
It becomes much slower, despite the graph splits looking
OK. Not sure where it bottlenecks.
2025-12-09 10:09:04 +00:00
..
2024-07-27 07:55:01 +02:00
2025-12-09 10:09:04 +00:00
2024-07-27 07:55:01 +02:00
2025-11-11 10:35:48 +02:00