ik_llama.cpp/github-data/pull_requests/10 - iq4_k_ speedup quantization by a factor of _2.md at e22b2d124635d7f9403b8ee4644e472a29a9b332 - ik_llama.cpp

ikawrakow/ik_llama.cpp

Fork 0

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-04-30 11:21:56 +00:00

Files

Thomas eaa2510a28 Add GitHub data: filename sanitization (#640 )

2025-07-23 13:31:53 +02:00

768 B

Raw Blame History

🔀 #10 - iq4_k: speedup quantization by a factor of ~2

Author	`ikawrakow`
State	❌ Closed
Created	2024-08-03
Updated	2024-08-03

Description

It is interesting to observe that clang produces code that is ~6X faster than the GCC result on a simple benchmark that measures the speed of the best_index_iq4n function (which is the bottleneck during IQ4_K quantization). But when this is used in practice in quantize_row_iq4_k_impl_bs16, the clang executable is actually worse than the GCC executable. Either way, both compilers need a hand, so this PR gives it to them. This gives us a ~2X speedup in the IQ4_K quantization.

768 B Raw Blame History

🔀 #10 - iq4_k: speedup quantization by a factor of ~2

Description

768 B

Raw Blame History