This website requires JavaScript.
Explore
Help
Register
Sign In
turboderp-org
/
exllamav3
Watch
1
Star
0
Fork
0
You've already forked exllamav3
mirror of
https://github.com/turboderp-org/exllamav3.git
synced
2026-03-15 00:07:24 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
5738bc62e52d379cab8065fcf72bd7d99070835e
exllamav3
/
science
History
turboderp
0f2da5d6a7
GEMM: Lock MCG multiplier to 0xCBAC1FED and MUL1 to 0x83DCD12D. Make MCG the default codebook for new models.
2025-10-12 22:09:01 +02:00
..
codebook_eval.py
GEMM: Lock MCG multiplier to 0xCBAC1FED and MUL1 to 0x83DCD12D. Make MCG the default codebook for new models.
2025-10-12 22:09:01 +02:00
gumbel_eval.py
Initial commit
2025-04-06 14:42:49 +02:00
kv_quant_exp.py
Add cache quantization
2025-04-22 21:52:33 +02:00
qgemm_benchmark.py
GEMM: Lock MCG multiplier to 0xCBAC1FED and MUL1 to 0x83DCD12D. Make MCG the default codebook for new models.
2025-10-12 22:09:01 +02:00
qgemm_pretune.py
Rework GEMM kernel tuning
2025-10-05 01:30:20 +02:00