无脸男
|
7823bac5ef
|
balance serve
|
2025-09-05 15:04:54 +08:00 |
|
无脸男
|
83b1ff07ab
|
ktransformers
|
2025-09-05 14:53:11 +08:00 |
|
无脸男
|
a76dedae8f
|
transformers
|
2025-09-05 14:37:27 +08:00 |
|
无脸男
|
cead5654ff
|
sampler
|
2025-09-05 14:30:59 +08:00 |
|
无脸男
|
0914117fe1
|
config
|
2025-09-05 14:28:59 +08:00 |
|
无脸男
|
0e25e7dfca
|
forward batch
|
2025-09-05 14:28:23 +08:00 |
|
无脸男
|
b851b8e300
|
query manager
|
2025-09-05 14:25:01 +08:00 |
|
无脸男
|
6f0b09b946
|
sched_rpc
|
2025-09-05 14:22:40 +08:00 |
|
无脸男
|
01467718f8
|
settings
|
2025-09-05 14:19:57 +08:00 |
|
无脸男
|
b55ed730d4
|
config
|
2025-09-05 14:15:31 +08:00 |
|
Azure
|
4ccbdb23ae
|
Merge pull request #1493 from Azure-Tang/main
Support kimi-k2-0905
|
2025-09-05 11:56:39 +08:00 |
|
无脸男
|
8e6d857c1a
|
main
|
2025-09-05 11:54:43 +08:00 |
|
Azure-Tang
|
b6d36bffbb
|
update kimi-k2-0905
|
2025-09-05 03:52:43 +00:00 |
|
无脸男
|
0cebe3a714
|
args
|
2025-09-05 11:49:30 +08:00 |
|
无脸男
|
baa66ff6d9
|
create_interface
|
2025-09-05 11:45:33 +08:00 |
|
无脸男
|
7895da6913
|
[wip]model runner
|
2025-09-05 11:34:23 +08:00 |
|
无脸男
|
5e2ef8d8fb
|
adaptor
|
2025-09-03 17:19:58 +08:00 |
|
RICHARDNAN
|
a0229b220c
|
同步0.2.4npu的scripts
|
2025-09-03 15:40:28 +08:00 |
|
无脸男
|
07b369cb95
|
adaptor
|
2025-09-03 10:34:31 +08:00 |
|
Jianwei Dong
|
ee2ede0412
|
Merge pull request #1466 from kvcache-ai/update-readme-djw
update smallthinker and glm4 readme
|
2025-07-31 11:15:28 +08:00 |
|
djw
|
5771990a07
|
update smallthinker and glm4 readme
|
2025-07-31 03:14:49 +00:00 |
|
Jianwei Dong
|
757add1a39
|
Merge pull request #1456 from kvcache-ai/support-smt-glm4
Support SmallThinker and GLM4-MoE
|
2025-07-27 17:20:00 +08:00 |
|
qiyuxinlin
|
1334ddc833
|
update readme
|
2025-07-25 17:02:36 +00:00 |
|
qiyuxinlin
|
9e1560bb82
|
GLM4 and SmallThinker
|
2025-07-25 16:56:36 +00:00 |
|
djw
|
c7307aa0ae
|
support smt and glm4
|
2025-07-25 16:24:38 +00:00 |
|
djw
|
17246bf84f
|
support smt and glm4
|
2025-07-25 15:03:27 +00:00 |
|
djw
|
48bc6185b5
|
support smt and qlm4
|
2025-07-25 12:48:51 +00:00 |
|
qiyuxinlin
|
712ad1fa3c
|
smallthinker right
|
2025-07-25 12:46:14 +00:00 |
|
Qiu Chengyu
|
f8719ee7b9
|
Add use_silu in MOEConfig in python and hard-determine smallthinker
|
2025-07-25 11:22:31 +00:00 |
|
Qiu Chengyu
|
cb808979fa
|
Add use_silu in MOEConfig on cpu
|
2025-07-25 10:57:01 +00:00 |
|
qiyuxinlin
|
71c1d4eed7
|
smallthink run
|
2025-07-24 15:08:29 +00:00 |
|
djw
|
590fcb41cd
|
support smt and glm4
|
2025-07-24 12:31:01 +00:00 |
|
djw
|
613f0b7c37
|
support smt and glm4
|
2025-07-24 09:39:19 +00:00 |
|
djw
|
b66d96db97
|
support smt and glm4
|
2025-07-24 08:40:58 +00:00 |
|
wang jiahao
|
1677e90092
|
Merge pull request #1439 from kvcache-ai/qiyuxinlin-patch-3
Update balance_serve.py
|
2025-07-12 13:14:54 +08:00 |
|
wang jiahao
|
a2e95e467a
|
Update balance_serve.py
|
2025-07-12 13:14:35 +08:00 |
|
UnicornChan
|
dc59af6167
|
Merge pull request #1438 from kvcache-ai/update-readme
Update Kimi-K2 Readme
|
2025-07-12 12:52:52 +08:00 |
|
chenxl
|
b5024f62a4
|
Update Kimi-K2 Readme
|
2025-07-12 12:51:00 +08:00 |
|
Atream
|
4fb367542b
|
Merge pull request #1437 from kvcache-ai/Atream-patch-5
Update Kimi-K2.md
|
2025-07-12 12:44:52 +08:00 |
|
Atream
|
34d2829f24
|
Update Kimi-K2.md
|
2025-07-12 12:44:41 +08:00 |
|
Atream
|
df19681ec4
|
Merge pull request #1436 from kvcache-ai/Atream-patch-4
Update Kimi-K2.md
|
2025-07-12 11:58:25 +08:00 |
|
Atream
|
90245d8a6b
|
Update Kimi-K2.md
|
2025-07-12 11:57:51 +08:00 |
|
Atream
|
8e2c67d655
|
Merge pull request #1435 from kvcache-ai/Atream-patch-3
Update Kimi-K2.md
|
2025-07-12 11:48:08 +08:00 |
|
Atream
|
378e4fc035
|
Update Kimi-K2.md
|
2025-07-12 11:47:42 +08:00 |
|
Atream
|
5d4a644456
|
Merge pull request #1434 from kvcache-ai/Atream-patch-2
Update Kimi-K2.md
|
2025-07-11 23:26:35 +08:00 |
|
Atream
|
b4ed8b6ded
|
Update Kimi-K2.md
|
2025-07-11 23:26:18 +08:00 |
|
UnicornChan
|
83c8e7928e
|
Merge pull request #1432 from kvcache-ai/UnicornChan-patch-1
Update Kimi-K2.md
|
2025-07-11 19:32:42 +08:00 |
|
UnicornChan
|
7800a413a2
|
Update Kimi-K2.md
|
2025-07-11 19:31:58 +08:00 |
|
Atream
|
2303889709
|
Merge pull request #1431 from kvcache-ai/support-kimi-k2
Support kimi k2
|
2025-07-11 09:36:01 +08:00 |
|
Atream
|
cf79c93fae
|
Update README.md
|
2025-07-11 09:35:12 +08:00 |
|