Files
ik_llama.cpp/src
Kawrakow 996e77047a Avoid ggml_get_rows if not necessary (#1160)
* Copy reduce result to other GPUs if necessary

* Avoid ggml_get_rows for TG

* For the output ops use the result of the split that ran on the main GPU

* More models
2026-01-20 15:38:21 +02:00
..
2026-01-05 08:00:01 +02:00
2026-01-05 08:00:01 +02:00
2025-11-30 18:45:38 +01:00
2025-11-30 18:45:38 +01:00
2026-01-05 08:00:01 +02:00
2026-01-05 08:00:01 +02:00
2024-07-27 07:55:01 +02:00
2024-07-27 07:55:01 +02:00