This website requires JavaScript.
Explore
Help
Register
Sign In
turboderp-org
/
exllamav2
Watch
1
Star
0
Fork
0
You've already forked exllamav2
mirror of
https://github.com/turboderp-org/exllamav2.git
synced
2026-05-11 16:30:25 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
6d576b3e562cd4fd9e0c77cb23adedc5ce94cf3b
exllamav2
/
examples
History
turboderp
6d576b3e56
Reworking attention, allow for batched inference with independent cache per sequence
2023-09-03 15:56:38 +02:00
..
chat.py
Initial commit
2023-08-30 11:05:23 +02:00
inference.py
Reworking attention, allow for batched inference with independent cache per sequence
2023-09-03 15:56:38 +02:00
streaming.py
Tidying up
2023-09-02 16:40:57 +02:00