Files
tabbyAPI/common
kingbri beb6d8faa5 Model: Adjust draft_gpu_split and add to config
The previous code overrode the existing gpu split and device idx
values. This now sets an independent draft_gpu_split value and
adjusts the gpu_devices check only if the draft_gpu_split array
is larger than the gpu_split array.

Draft gpu split is not Tensor Parallel, and defaults to gpu_split_auto
if a split is not provided.

Signed-off-by: kingbri <8082010+kingbri1@users.noreply.github.com>
2025-02-08 16:09:46 -05:00
..
2024-09-21 14:36:21 -04:00
2024-09-18 20:36:17 -04:00
2025-02-07 18:40:28 -05:00
2024-11-21 18:06:47 -05:00
2025-02-07 18:03:33 -05:00
2024-09-11 18:00:29 +01:00
2024-09-10 20:52:29 -04:00
2024-09-23 21:42:01 -04:00