tabbyAPI/common/transformers_utils.py at 76ffc7c4588c6ab5b3092c1284b9f527b103bd96

mirror of https://github.com/theroyallab/tabbyAPI.git synced 2026-03-14 15:57:27 +00:00

Files

kingbri 2096c9bad2 Model: Default max_seq_len to 4096

A common problem in TabbyAPI is that users who want to get up and
running with a model always had issues with max_seq_len causing OOMs.
This is because model devs set max context values in the millions which
requires a lot of VRAM.

To idiot-proof first time setup, make the fallback default 4096 so
users can run their models. If a user still wants to use the model's
max_seq_len, set it to -1.

Signed-off-by: kingbri <8082010+kingbri1@users.noreply.github.com>

2025-06-13 14:57:24 -04:00

5.2 KiB

Raw Blame History

View Raw

5.2 KiB Raw Blame History

5.2 KiB

Raw Blame History