Commit Graph

71 Commits

Author SHA1 Message Date
turboderp
021b027728 Qwen3.5: Enable MRoPE, update multimodal example 2026-03-02 05:27:33 +01:00
turboderp
c8c2e6178c chat.py: Catbench shortcut 2026-03-01 17:57:55 +01:00
turboderp
18b2a23d8a chat.py: Fix error message 2026-03-01 15:10:22 +01:00
turboderp
75ee2c78c3 Add Qwen2_5_VLForConditionalGeneration, refactor HCXVisionV2VisionModel as subclass of Qwen2_5VLVisionModel 2026-01-19 22:48:49 +01:00
turboderp
288a98f5e3 Refactor sampler args for examples 2026-01-11 12:33:27 +01:00
turboderp
a17d1a4334 Add HCXVisionV2ForCausalLM architecture 2026-01-06 16:01:54 +01:00
turboderp
6e75e7b151 chat.py: Fix for models with eos_token_id=null 2026-01-04 02:02:10 +01:00
turboderp
e8b77bba4a chat.py: Fix prompt tokens/s display 2025-12-25 23:18:50 +01:00
turboderp
80907797a5 chat.py: Add debug mode 2025-12-25 23:18:25 +01:00
turboderp
d8be5d638f chat.py: Read all stop conditions from config.json 2025-12-10 00:53:45 +01:00
turboderp
ba657d399d chat.py: Add Ministral template 2025-12-03 18:23:34 +01:00
turboderp
ef8fd43d1c Cleanup unused imports 2025-11-16 14:25:46 +01:00
turboderp
c47c17bae2 Add Glm4V-MoE architecture 2025-11-13 16:56:29 +01:00
turboderp
7c6b6c473f chat.py: Allow breaking stream with esc 2025-11-13 12:55:32 +01:00
turboderp
0bd0bfa17d chat.py: Token dump feature 2025-11-13 12:54:34 +01:00
turboderp
c3886f0841 chat.py: Don't crash on wrong stop conditions 2025-11-13 12:53:23 +01:00
turboderp
1dff8123a6 chat.py: Better SVG extract 2025-11-13 12:52:46 +01:00
turboderp
792bd9dc75 Glm4V: Update examples 2025-11-13 12:52:46 +01:00
turboderp
fcce0e6985 Fix example Mistral template 2025-11-10 01:34:26 +01:00
turboderp
8190932910 Update multimodal example 2025-11-09 13:34:11 +01:00
turboderp
20cbe13cab Update multimodal example 2025-11-09 01:42:30 +01:00
turboderp
fab5da05ce chat.py: Better SVG extraction 2025-11-02 23:54:22 +01:00
turboderp
08e88cf918 chat.py: Change default ChatML sysprompt 2025-11-01 15:59:31 +01:00
turboderp
a358e845c1 chat.py: Add (SVG) save option 2025-11-01 14:51:06 +01:00
turboderp
ef077083f9 chat.py: Fix template description 2025-11-01 14:50:51 +01:00
turboderp
8a044a4cf4 chat.py: Add MiniMax template 2025-10-31 11:18:35 +01:00
turboderp
c7f0b694b4 Update example 2025-09-19 00:48:12 +02:00
turboderp
59e3304da1 chat.py: Add Apertus template 2025-09-04 00:38:51 +02:00
turboderp
40fd9d3857 chat.py: Add Seed-OSS template 2025-08-24 21:40:26 +02:00
turboderp
d9ac8d2fd0 Fix regression #71 2025-08-16 23:39:14 +02:00
turboderp
c1b469c0ea chat.py: Add single-prompt-and-exit option 2025-08-12 18:26:16 +02:00
turboderp
921f0c4155 chat.py: Add Ctrl-C hook for cleaner multiprocess exit 2025-07-29 22:01:45 +02:00
turboderp
1be81eed68 chat.py: Add Exaone template 2025-07-15 20:34:30 +02:00
turboderp
08dde73e66 Add Formatron support and improved logit masking 2025-07-11 21:29:40 +02:00
turboderp
e12a1a5d0b chat.py: Add some more commands 2025-07-10 18:32:27 +02:00
turboderp
823fd8db85 chat.py: Add Command-A template 2025-07-09 21:24:06 +02:00
turboderp
4f0414a1e9 chat.py: Add SmolLM3 template 2025-07-09 03:24:36 +02:00
turboderp
4d0e5d3d5d chat.py: Add Ernie template 2025-07-09 03:00:28 +02:00
turboderp
6341b119ef Loader: Add tensor override script 2025-07-08 18:58:43 +02:00
turboderp
b2b0980972 chat.py: Fix t/s display 2025-07-07 12:55:03 +02:00
turboderp
0cdb499549 chat.py: Add Dots template 2025-06-14 15:13:13 +02:00
turboderp
72bdf5a39c Fix example 2025-06-12 04:14:52 +02:00
turboderp
bcaa636f66 Add Transformers integration (still needs some work) 2025-06-07 15:19:44 +02:00
turboderp
7ea9559837 Add image search example 2025-06-06 16:32:25 +02:00
turboderp
ea4ef8dcbb Chat.py: Add stats 2025-06-06 03:43:41 +02:00
turboderp
98964cbdae chat.py: Allow listing supported modes without specifying valid mode 2025-06-05 02:26:00 +02:00
turboderp
9a24e0ce52 Update multimodal example 2025-06-03 03:10:21 +02:00
turboderp
0e27484613 Fix example 2025-06-02 20:10:27 +02:00
turboderp
c264054b94 Add multimodal example 2025-06-02 02:27:17 +02:00
turboderp
460e201cc3 Add batched translation example 2025-05-25 15:03:19 +02:00