tabbyAPI/requirements-colab.txt at 3f2fcbcc4512a942d8eb29a490e9a2c3a8bea621 - tabbyAPI - Public git mirror

theroyallab/tabbyAPI

mirror of https://github.com/theroyallab/tabbyAPI.git synced 2026-03-14 15:57:27 +00:00

Files

DocShotgun 67507105d0 Update colab, expose additional args

* Exposed draft model args for speculative decoding
* Exposed int8 cache, dummy models, and no flash attention
* Resolved CUDA 11.8 dependency issue

2023-12-04 22:20:46 -08:00

11 lines

368 B

Plaintext

Raw Blame History

 fastapi
 pydantic < 2,>= 1
 PyYAML
 progress
 uvicorn
 # Wheels
 # Linux FA2 from https://github.com/Dao-AILab/flash-attention/releases
 https://github.com/Dao-AILab/flash-attention/releases/download/v2.3.6/flash_attn-2.3.6+cu118torch2.1cxx11abiFALSE-cp310-cp310-linux_x86_64.whl; platform_system == "Linux" and platform_machine == "x86_64" and python_version == "3.10"