ik_llama.cpp/github-data/pull_requests/144 - Slightly faster IQ4_K_R4 on AVX2_Zen4.md at 16f30fcf3181a13130fb3673d042eb35b4e60156 - ik_llama.cpp

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-04-30 11:21:56 +00:00

Files

Thomas 0451f10a42 Add GitHub data: filename sanitization (#640 )

2025-07-23 13:31:53 +02:00

We get PP-512(LLaMA-3.1-8B) = 251 t/s (Ryzen-7950X) or 249 t/s (Ryzen-5975WX), up from 232/227 t/s.