Jianwei Dong
|
8ff3966173
|
Merge pull request #1505 from kvcache-ai/support-qwen3next
fix qwen3next bug
|
2025-09-16 21:29:33 +08:00 |
|
djw
|
d9c75cb5aa
|
Merge branch 'support-qwen3next' of https://github.com/kvcache-ai/ktransformers into support-qwen3next
|
2025-09-16 13:26:37 +00:00 |
|
djw
|
0437660e62
|
fix bug
|
2025-09-16 13:21:58 +00:00 |
|
Jianwei Dong
|
880daa7fde
|
Merge pull request #1500 from kvcache-ai/support-qwen3next
Support qwen3next
|
2025-09-12 21:59:47 +08:00 |
|
Jianwei Dong
|
d4b3fe2427
|
Merge branch 'main' into support-qwen3next
|
2025-09-12 21:59:32 +08:00 |
|
Azure
|
41d84d54a0
|
Merge pull request #1494 from kvcache-ai/Azure-Tang-patch-1
Update date for Kimi-K2-0905 support
|
2025-09-11 23:49:43 +08:00 |
|
djw
|
a44b710649
|
support qwen3 next
|
2025-09-11 11:55:09 +00:00 |
|
djw
|
1bc9213e7b
|
support qwen3 next
|
2025-09-11 09:57:42 +00:00 |
|
djw
|
32ed7c0687
|
support qwen3 next
|
2025-09-11 09:56:21 +00:00 |
|
cen121212
|
1dbbf3be9f
|
Merge pull request #10 from RICHARDNAN/br_czn_main-9-1
revert install.sh
|
2025-09-11 15:41:51 +08:00 |
|
RICHARDNAN
|
faf86e2ff5
|
Update install.sh
|
2025-09-11 15:26:02 +08:00 |
|
cen121212
|
d7005e0785
|
Merge pull request #9 from cen121212/main-9-1-chengshaoxu
merge NPU csrc to GPU: part 6
|
2025-09-11 14:42:47 +08:00 |
|
Shaoxu Cheng
|
899e5c492c
|
merge NPU csrc to GPU: part 6
|
2025-09-11 14:40:34 +08:00 |
|
cen121212
|
4fd8fbe675
|
Merge pull request #5 from cen121212/br_whq_cmakelist_into_main
Merge CMakeLists.txt in for_arm
|
2025-09-11 11:12:49 +08:00 |
|
wanghanqingLYT
|
758e1790e0
|
Merge branch 'main-9-1' into br_whq_cmakelist_into_main
|
2025-09-11 10:22:22 +08:00 |
|
cen121212
|
aa11b35540
|
Merge pull request #2 from cen121212/br_whq_v0.2.4_into_main
merge arm branch for sgemm.cpp, tinyblas_cpu_sgemm.inc and iqk_mul_ma…
|
2025-09-11 09:37:50 +08:00 |
|
cen121212
|
cea135e56b
|
Merge pull request #3 from WithHades/main-9-1
Main 9 1
|
2025-09-11 09:36:24 +08:00 |
|
cen121212
|
a582b2ad7d
|
Merge pull request #4 from RICHARDNAN/br_czn_main-9-1
merge install.sh install_for_npu.sh setup.py
|
2025-09-11 09:36:03 +08:00 |
|
cen121212
|
0c9b3504dd
|
Merge pull request #7 from cen121212/main-9-1-chengshaoxu
Merge npu csrc part to ktransformers
|
2025-09-11 09:34:56 +08:00 |
|
cen121212
|
27a9da62ba
|
Merge pull request #8 from cen121212/main-9-1-luochen
适配npu----models/operators文件夹
|
2025-09-11 09:34:41 +08:00 |
|
djw
|
3550b03795
|
support qwen3 next
|
2025-09-10 18:55:33 +00:00 |
|
wanghanqingLYT
|
f2aec8032f
|
npu enabling for deepseekv3 model and expert.py
|
2025-09-09 15:35:42 +08:00 |
|
cen121212
|
a9a9a95b0b
|
适配npu-models/operators文件夹4
|
2025-09-08 19:43:06 +08:00 |
|
cen121212
|
3aee5caa77
|
适配npu-models/operators文件夹3
|
2025-09-08 19:42:41 +08:00 |
|
Shaoxu Cheng
|
dcabb3ca6e
|
merge NPU csrc to GPU: part 5
|
2025-09-08 19:42:33 +08:00 |
|
cen121212
|
ef2665a362
|
适配npu-models/operators文件夹2
|
2025-09-08 19:42:14 +08:00 |
|
Shaoxu Cheng
|
00f536622d
|
merge NPU csrc to GPU: part 4
|
2025-09-08 19:42:14 +08:00 |
|
Shaoxu Cheng
|
1d94264992
|
merge NPU csrc to GPU: part 3
|
2025-09-08 19:41:13 +08:00 |
|
Shaoxu Cheng
|
dea06aa77f
|
merge NPU csrc to GPU: part 2
|
2025-09-08 19:40:53 +08:00 |
|
cen121212
|
e0318c0fc3
|
适配npu-models/operators文件夹
|
2025-09-08 19:40:36 +08:00 |
|
Shaoxu Cheng
|
3125616ca2
|
merge NPU csrc to GPU: part 1
|
2025-09-08 19:40:17 +08:00 |
|
无脸男
|
f2a3ba0697
|
ktransformers
|
2025-09-08 17:49:18 +08:00 |
|
无脸男
|
cecc37841d
|
balance serve
|
2025-09-08 17:46:26 +08:00 |
|
无脸男
|
35369ed6e3
|
completions
|
2025-09-08 17:30:49 +08:00 |
|
wanghanqingLYT
|
ed566b5f23
|
Merge CMakeLists.txt in for_arm
|
2025-09-08 17:24:53 +08:00 |
|
RICHARDNAN
|
c89959fe1d
|
Update setup.py
|
2025-09-08 17:18:57 +08:00 |
|
无脸男
|
a344f2b5d4
|
utils
|
2025-09-08 17:18:07 +08:00 |
|
无脸男
|
76301e8e6e
|
utils
|
2025-09-08 17:07:38 +08:00 |
|
RICHARDNAN
|
3e700fd536
|
Update merge_safetensor_gguf.py
|
2025-09-08 15:42:06 +08:00 |
|
RICHARDNAN
|
125558851e
|
Delete serve_test.sh
|
2025-09-08 15:38:54 +08:00 |
|
RICHARDNAN
|
e0c8258b2a
|
Delete scripts directory
|
2025-09-08 15:38:30 +08:00 |
|
RICHARDNAN
|
82b9bbfa49
|
Update install_for_npu.sh
|
2025-09-08 15:38:05 +08:00 |
|
RICHARDNAN
|
ba9e964fcd
|
Update merge_safetensor_gguf.py
|
2025-09-08 15:34:30 +08:00 |
|
无脸男
|
3d8ff57f78
|
custom loader
|
2025-09-08 14:56:04 +08:00 |
|
无脸男
|
68eadd3bdc
|
custom gguf loader
|
2025-09-08 14:51:25 +08:00 |
|
无脸男
|
9299c25e43
|
optimize.py
|
2025-09-08 14:48:54 +08:00 |
|
无脸男
|
d0432ed5c4
|
yaml
|
2025-09-08 14:46:33 +08:00 |
|
wanghanqingLYT
|
e7e2c2bd70
|
merge arm branch for sgemm.cpp, tinyblas_cpu_sgemm.inc and iqk_mul_mat.inc
|
2025-09-08 14:11:17 +08:00 |
|
Atream
|
72cd2a5af7
|
Merge pull request #1495 from kvcache-ai/Atream-patch-6
Update GGUF format link in Kimi-K2 documentation
|
2025-09-05 20:19:50 +08:00 |
|
Atream
|
64b3b30ba3
|
Update GGUF format link in Kimi-K2 documentation
|
2025-09-05 20:19:37 +08:00 |
|