Files
tabbyAPI/backends/exllamav2
kingbri d759a15559 Model: Fix chunk size handling
Wrong class attribute name used for max_attention_size and fixes
declaration of the draft model's chunk_size.

Also expose the parameter to the end user in both config and model
load.

Signed-off-by: kingbri <bdashore3@proton.me>
2024-04-07 18:39:19 -04:00
..
2024-04-07 18:39:19 -04:00
2024-04-07 18:00:56 -04:00