Files
ik_llama.cpp/github-data/pull_requests/115 - MMQ for Q6_0.md
2025-07-23 13:31:53 +02:00

770 B

🔀 #115 - MMQ for Q6_0

Author ikawrakow
State Closed
Created 2024-11-20
Updated 2024-11-21

Description

Add MMQ kernel for Q6_0.

@Nexesenex


💬 Conversation

👤 Nexesenex commented the 2024-11-20 at 19:42:01:

Tested successfully on IK_LLame, PPL is 0.1% above Q6_K on a pure quant of Sheared Llama 2.7b. Thanks IK. I'll play with the Qwen models in the next days.


👤 Nexesenex commented the 2024-11-20 at 19:42:56:

Tested successfully on IK_LLama, PPL is 0.1% above Q6_K on a pure quant of Sheared Llama 2.7b. Thanks IK. I'll play with the Qwen models in the next days.