Files
ik_llama.cpp/src
Kawrakow f65fefa36c Slightly faster TG for split mode "graph" (#1057)
* Rearrange graph nodes

So that we can do graph portions that are the same on 2 or more
GPUs at the same time.

* Separate graph compute implementation for split mode graph

* This is better

---------

Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
2025-12-12 07:54:37 +01:00
..
2025-11-30 18:45:38 +01:00
2025-11-30 18:45:38 +01:00
2025-10-30 10:49:48 +02:00
2025-11-30 18:45:38 +01:00
2025-06-19 10:24:53 +03:00
2024-07-27 07:55:01 +02:00
2024-07-27 07:55:01 +02:00