ik_llama.cpp/github-data/pull_requests/137 - Fix AVX2 implementation of iq4_nl_r4.md at d44b2fa4ab984594aee8f136c044caefe7c2e2af - ik_llama.cpp

ikawrakow/ik_llama.cpp

Fork 0

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-04-24 00:19:19 +00:00

Files

Thomas eaa2510a28 Add GitHub data: filename sanitization (#640 )

2025-07-23 13:31:53 +02:00

486 B

Raw Blame History

🐛 #137 - Fix AVX2 implementation of iq4_nl_r4

Author	`ikawrakow`
State	❌ Closed
Created	2024-12-11
Updated	2024-12-11

Description

The implementation was using _mm256_maddubs_epi16, which overflows (and gets saturated) with the unsigned version of the non-linear quants IQ4_NL lookup table. This PR fixes it without a noticeable performance loss.

486 B Raw Blame History

🐛 #137 - Fix AVX2 implementation of iq4_nl_r4

Description

486 B

Raw Blame History