ik_llama.cpp/llama.cpp at 8b531d3640d6770c2ed5ad790eb55f844e3b452a

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-03-12 15:00:11 +00:00

Files

Paul Tsochantaris c7fa729d3a llama : do not cap thread count when MoE on CPU (#5419 )

* Not capping thread count when MoE inference is running on CPU

* Whitespace

2024-02-09 12:48:06 +02:00

View Raw