Commit Graph

1094 Commits

Author SHA1 Message Date
无脸男
7823bac5ef balance serve 2025-09-05 15:04:54 +08:00
无脸男
83b1ff07ab ktransformers 2025-09-05 14:53:11 +08:00
无脸男
a76dedae8f transformers 2025-09-05 14:37:27 +08:00
无脸男
cead5654ff sampler 2025-09-05 14:30:59 +08:00
无脸男
0914117fe1 config 2025-09-05 14:28:59 +08:00
无脸男
0e25e7dfca forward batch 2025-09-05 14:28:23 +08:00
无脸男
b851b8e300 query manager 2025-09-05 14:25:01 +08:00
无脸男
6f0b09b946 sched_rpc 2025-09-05 14:22:40 +08:00
无脸男
01467718f8 settings 2025-09-05 14:19:57 +08:00
无脸男
b55ed730d4 config 2025-09-05 14:15:31 +08:00
Azure
4ccbdb23ae Merge pull request #1493 from Azure-Tang/main
Support kimi-k2-0905
2025-09-05 11:56:39 +08:00
无脸男
8e6d857c1a main 2025-09-05 11:54:43 +08:00
Azure-Tang
b6d36bffbb update kimi-k2-0905 2025-09-05 03:52:43 +00:00
无脸男
0cebe3a714 args 2025-09-05 11:49:30 +08:00
无脸男
baa66ff6d9 create_interface 2025-09-05 11:45:33 +08:00
无脸男
7895da6913 [wip]model runner 2025-09-05 11:34:23 +08:00
无脸男
5e2ef8d8fb adaptor 2025-09-03 17:19:58 +08:00
RICHARDNAN
a0229b220c 同步0.2.4npu的scripts 2025-09-03 15:40:28 +08:00
无脸男
07b369cb95 adaptor 2025-09-03 10:34:31 +08:00
Jianwei Dong
ee2ede0412 Merge pull request #1466 from kvcache-ai/update-readme-djw
update smallthinker and glm4 readme
2025-07-31 11:15:28 +08:00
djw
5771990a07 update smallthinker and glm4 readme 2025-07-31 03:14:49 +00:00
Jianwei Dong
757add1a39 Merge pull request #1456 from kvcache-ai/support-smt-glm4
Support SmallThinker and GLM4-MoE
2025-07-27 17:20:00 +08:00
qiyuxinlin
1334ddc833 update readme 2025-07-25 17:02:36 +00:00
qiyuxinlin
9e1560bb82 GLM4 and SmallThinker 2025-07-25 16:56:36 +00:00
djw
c7307aa0ae support smt and glm4 2025-07-25 16:24:38 +00:00
djw
17246bf84f support smt and glm4 2025-07-25 15:03:27 +00:00
djw
48bc6185b5 support smt and qlm4 2025-07-25 12:48:51 +00:00
qiyuxinlin
712ad1fa3c smallthinker right 2025-07-25 12:46:14 +00:00
Qiu Chengyu
f8719ee7b9 Add use_silu in MOEConfig in python and hard-determine smallthinker 2025-07-25 11:22:31 +00:00
Qiu Chengyu
cb808979fa Add use_silu in MOEConfig on cpu 2025-07-25 10:57:01 +00:00
qiyuxinlin
71c1d4eed7 smallthink run 2025-07-24 15:08:29 +00:00
djw
590fcb41cd support smt and glm4 2025-07-24 12:31:01 +00:00
djw
613f0b7c37 support smt and glm4 2025-07-24 09:39:19 +00:00
djw
b66d96db97 support smt and glm4 2025-07-24 08:40:58 +00:00
wang jiahao
1677e90092 Merge pull request #1439 from kvcache-ai/qiyuxinlin-patch-3
Update balance_serve.py
2025-07-12 13:14:54 +08:00
wang jiahao
a2e95e467a Update balance_serve.py 2025-07-12 13:14:35 +08:00
UnicornChan
dc59af6167 Merge pull request #1438 from kvcache-ai/update-readme
Update Kimi-K2 Readme
2025-07-12 12:52:52 +08:00
chenxl
b5024f62a4 Update Kimi-K2 Readme 2025-07-12 12:51:00 +08:00
Atream
4fb367542b Merge pull request #1437 from kvcache-ai/Atream-patch-5
Update Kimi-K2.md
2025-07-12 12:44:52 +08:00
Atream
34d2829f24 Update Kimi-K2.md 2025-07-12 12:44:41 +08:00
Atream
df19681ec4 Merge pull request #1436 from kvcache-ai/Atream-patch-4
Update Kimi-K2.md
2025-07-12 11:58:25 +08:00
Atream
90245d8a6b Update Kimi-K2.md 2025-07-12 11:57:51 +08:00
Atream
8e2c67d655 Merge pull request #1435 from kvcache-ai/Atream-patch-3
Update Kimi-K2.md
2025-07-12 11:48:08 +08:00
Atream
378e4fc035 Update Kimi-K2.md 2025-07-12 11:47:42 +08:00
Atream
5d4a644456 Merge pull request #1434 from kvcache-ai/Atream-patch-2
Update Kimi-K2.md
2025-07-11 23:26:35 +08:00
Atream
b4ed8b6ded Update Kimi-K2.md 2025-07-11 23:26:18 +08:00
UnicornChan
83c8e7928e Merge pull request #1432 from kvcache-ai/UnicornChan-patch-1
Update Kimi-K2.md
2025-07-11 19:32:42 +08:00
UnicornChan
7800a413a2 Update Kimi-K2.md 2025-07-11 19:31:58 +08:00
Atream
2303889709 Merge pull request #1431 from kvcache-ai/support-kimi-k2
Support kimi k2
2025-07-11 09:36:01 +08:00
Atream
cf79c93fae Update README.md 2025-07-11 09:35:12 +08:00