This website requires JavaScript.
Explore
Help
Register
Sign In
turboderp-org
/
exllamav2
Watch
1
Star
0
Fork
0
You've already forked exllamav2
mirror of
https://github.com/turboderp-org/exllamav2.git
synced
2026-05-11 16:30:25 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
3c80d41234987b3aa92ad4594c5267bb7f770b21
exllamav2
/
examples
History
turboderp
3c80d41234
Add 4-bit GPTQ support
2023-09-05 14:03:51 +02:00
..
chat.py
Add 4-bit GPTQ support
2023-09-05 14:03:51 +02:00
inference.py
Reworking attention, allow for batched inference with independent cache per sequence
2023-09-03 15:56:38 +02:00
streaming.py
Tidying up
2023-09-02 16:40:57 +02:00