ik_llama.cpp

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-02-07 06:50:09 +00:00

Files

Kawrakow 6a5c180be9 Fix bf16 additions on CUDA arch < Ampere (#1164 )

* Fix bf16 additions on CUDA arch < Ampere

* Prevent using NCCL if graph reduce type is bf16 and arch < AMPERE

2026-01-19 12:27:52 +02:00

2024-07-27 07:55:01 +02:00

2026-01-10 08:01:22 +02:00

2026-01-19 12:27:52 +02:00

.gitignore

2024-07-27 07:55:01 +02:00

CMakeLists.txt

2026-01-07 18:33:17 +02:00