ik_llama.cpp

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-03-13 07:20:15 +00:00

Files

Kawrakow b8402290ef Much faster prompt processing for IQ1_S and IQ1_M on ARM_NEON (#553 )

* iq1_s

66.3 t/s -> 168.8 t/s.

* iq1_m

19 t/s -> 163 t/s.

---------

Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>

2025-06-24 14:21:37 +02:00

2024-07-27 07:55:01 +02:00

2025-06-08 17:27:00 +03:00

2025-06-24 14:21:37 +02:00

.gitignore

2024-07-27 07:55:01 +02:00

CMakeLists.txt

2025-06-12 19:25:11 +03:00