This website requires JavaScript.
Explore
Help
Register
Sign In
theroyallab
/
tabbyAPI
Watch
1
Star
0
Fork
0
You've already forked tabbyAPI
mirror of
https://github.com/theroyallab/tabbyAPI.git
synced
2026-03-15 00:07:28 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
304df16543c199f4e6f60d52cb4a5b835542b1ae
tabbyAPI
/
backends
/
exllamav2
History
Brian
2e491472d1
Merge pull request
#254
from lucyknada/main
...
add draft_gpu_split option for spec decoding
2025-02-11 16:48:03 -05:00
..
grammar.py
Grammar: Cache the engine vocabulary
2024-12-05 21:36:37 -08:00
model.py
Model: Adjust draft_gpu_split and add to config
2025-02-08 16:09:46 -05:00
utils.py
Dependencies: Update torch, exllamav2, and flash-attn
2025-02-09 01:27:48 -05:00
version.py
Dependencies: Update torch, exllamav2, and flash-attn
2025-02-09 01:27:48 -05:00
vision.py
Dependencies: Fix OpenAPI generation
2024-11-22 17:59:20 -05:00