ik_llama.cpp/sgemm.cpp at 64da6f7a971eda1030f0f641ba6b43dca6d0dcc6

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-03-05 19:40:19 +00:00

Files

Kawrakow e0b52e14a6 iqk_mul_mat: fp16 implementation for AVX2

This simple implementation beats jart's tiniBLAS by a
small margin (143 t/s vs 137 t/s for PP-512, TG is
4.75 t/s, so exactly the same as ggml).

2024-06-22 12:02:50 +03:00

View Raw