Files
ik_llama.cpp/llama.cpp
Paul Tsochantaris c7fa729d3a llama : do not cap thread count when MoE on CPU (#5419)
* Not capping thread count when MoE inference is running on CPU

* Whitespace
2024-02-09 12:48:06 +02:00

460 KiB