exllamav3/eval at 34d2f1f5fa069b842b8791de8c8db499b3884579 - exllamav3 - Public git mirror

turboderp-org/exllamav3

mirror of https://github.com/turboderp-org/exllamav3.git synced 2026-04-20 14:29:51 +00:00

Files

History

turboderp 34d2f1f5fa Add prequant_test script

2025-05-30 19:42:49 +02:00

..

Add simple long-context evaluation script

2025-05-17 16:58:12 +02:00

compare_q.py: Fix some logic for KLD test

2025-05-18 21:55:26 +02:00

compare_q_exllamav2.py

compare_q.py: Account for unquantized weights in blocksparse EXL2 layers

2025-05-14 23:55:25 +02:00

compare_q_exllamav3.py

compare_q.py: Add KLD test and some other tweaks

2025-05-16 16:13:26 +02:00

compare_q_llamacpp.py

compare_q.py: Fix llama.cpp bpw measurement for MoE models

2025-05-18 00:19:59 +02:00

compare_q_transformers.py

compare_q.py: Add more GPTQ layer types

2025-05-18 00:19:19 +02:00

compare_q.py

compare_q.py: Fix some logic for KLD test

2025-05-18 21:55:26 +02:00

humaneval.py

HumanEval: Move BOS token to individual prompt template, don't prepend by default when tokenizing

2025-05-11 23:02:07 +02:00

longctx.py

Add simple long-context evaluation script

2025-05-17 16:58:12 +02:00

model_diff.py

model_diff.py: Add device argument

2025-05-30 19:42:49 +02:00

ppl.py

Reduce VRAM overhead in ppl test

2025-04-17 22:22:47 +02:00

prequant_test.py

Add prequant_test script

2025-05-30 19:42:49 +02:00