Files
tabbyAPI/backends/exllamav2
kingbri 09a4c79847 Model: Auto-scale max_tokens by default
If max_tokens is None, it automatically scales to fill up the context.
This does not mean the generation will fill up that context since
EOS stops also exist.

Originally suggested by #86

Signed-off-by: kingbri <bdashore3@proton.me>
2024-03-18 22:54:59 -04:00
..
2024-03-08 01:00:48 -05:00
2024-03-08 01:00:48 -05:00