Files
ik_llama.cpp/ggml
Kawrakow 6a5c180be9 Fix bf16 additions on CUDA arch < Ampere (#1164)
* Fix bf16 additions on CUDA arch < Ampere

* Prevent using NCCL if graph reduce type is bf16 and arch < AMPERE
2026-01-19 12:27:52 +02:00
..
2024-07-27 07:55:01 +02:00
2024-07-27 07:55:01 +02:00