mirror of
https://github.com/ikawrakow/ik_llama.cpp.git
synced 2026-01-26 17:20:01 +00:00
Use bf16 instead of fp16 block scales for q8_1 (#292)
* WIP - not working * q8_0 without bells and wistles works * It works for q8_0 * Use bf16 instead of f16,int16 * q4_0_r8 * q5_0_r4 * q6_0_r4 * Also q4_1 and q5_1 * q8_0_r8 on avx2 --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
This commit is contained in:
@@ -396,8 +396,9 @@ extern "C" {
|
||||
//
|
||||
GGML_TYPE_I2_S = 36,
|
||||
//
|
||||
GGML_TYPE_Q8_0_X4 = 98,
|
||||
GGML_TYPE_Q8_1_X4 = 99,
|
||||
GGML_TYPE_Q8_0_X4 = 97,
|
||||
GGML_TYPE_Q8_1_X4 = 98,
|
||||
GGML_TYPE_Q8_2_X4 = 99,
|
||||
GGML_TYPE_Q6_0 = 133,
|
||||
GGML_TYPE_IQ1_BN = 134,
|
||||
GGML_TYPE_IQ2_BN = 135,
|
||||
|
||||
Reference in New Issue
Block a user