Custom quantization rules with regular expressions (#244)

* Custom quantization rules with regular expressions

* Add the --custom-q option to the help

---------

Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
This commit is contained in:
Kawrakow
2025-03-07 08:54:09 +02:00
committed by GitHub
parent a3f6ee27cc
commit f8fb8ec9aa
3 changed files with 51 additions and 0 deletions

View File

@@ -418,6 +418,7 @@ extern "C" {
bool ignore_imatrix_rules; // If set to true, the built-in rules for refusing to quantize into certain quants without imatrix are ignored
void * imatrix; // pointer to importance matrix data
void * kv_overrides; // pointer to vector containing overrides
void * custom_quants; // pointer to vector containing custom quantization rules
} llama_model_quantize_params;
// grammar types