Files
ik_llama.cpp/ggml
Iwan Kawrakow 4b9350b353 Adding bf16_r8
Small performance gain compared to bf16 - 258 t/s vs 234 t/s.
I guess, this is still sub-obtimal.
2024-12-14 17:28:29 +02:00
..
2024-07-27 07:55:01 +02:00
2024-12-14 15:20:05 +02:00
2024-12-14 17:28:29 +02:00
2024-07-27 07:55:01 +02:00
2024-10-04 14:43:26 +03:00