This website requires JavaScript.
Explore
Help
Register
Sign In
turboderp-org
/
exllamav3
Watch
1
Star
0
Fork
0
You've already forked exllamav3
mirror of
https://github.com/turboderp-org/exllamav3.git
synced
2026-05-11 16:30:12 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
master
Add File
New File
Upload File
Apply Patch
exllamav3
/
science
History
turboderp
1f220f6e50
GEMM/MGEMM: Add autotuning with disk cache, remove static tuning table
2026-04-29 15:55:54 +02:00
..
codebook_eval.py
GEMM: Lock MCG multiplier to 0xCBAC1FED and MUL1 to 0x83DCD12D. Make MCG the default codebook for new models.
2025-10-12 22:09:01 +02:00
gumbel_eval.py
Initial commit
2025-04-06 14:42:49 +02:00
kv_quant_exp.py
Add cache quantization
2025-04-22 21:52:33 +02:00
qgemm_benchmark.py
GEMM: Lock MCG multiplier to 0xCBAC1FED and MUL1 to 0x83DCD12D. Make MCG the default codebook for new models.
2025-10-12 22:09:01 +02:00