Commit Graph

1098 Commits

Author SHA1 Message Date
Jianwei Dong
8ff3966173 Merge pull request #1505 from kvcache-ai/support-qwen3next
fix qwen3next bug
2025-09-16 21:29:33 +08:00
djw
d9c75cb5aa Merge branch 'support-qwen3next' of https://github.com/kvcache-ai/ktransformers into support-qwen3next 2025-09-16 13:26:37 +00:00
djw
0437660e62 fix bug 2025-09-16 13:21:58 +00:00
Jianwei Dong
880daa7fde Merge pull request #1500 from kvcache-ai/support-qwen3next
Support qwen3next
2025-09-12 21:59:47 +08:00
Jianwei Dong
d4b3fe2427 Merge branch 'main' into support-qwen3next 2025-09-12 21:59:32 +08:00
Azure
41d84d54a0 Merge pull request #1494 from kvcache-ai/Azure-Tang-patch-1
Update date for Kimi-K2-0905 support
2025-09-11 23:49:43 +08:00
djw
a44b710649 support qwen3 next 2025-09-11 11:55:09 +00:00
djw
1bc9213e7b support qwen3 next 2025-09-11 09:57:42 +00:00
djw
32ed7c0687 support qwen3 next 2025-09-11 09:56:21 +00:00
cen121212
1dbbf3be9f Merge pull request #10 from RICHARDNAN/br_czn_main-9-1
revert install.sh
2025-09-11 15:41:51 +08:00
RICHARDNAN
faf86e2ff5 Update install.sh 2025-09-11 15:26:02 +08:00
cen121212
d7005e0785 Merge pull request #9 from cen121212/main-9-1-chengshaoxu
merge NPU csrc to GPU: part 6
2025-09-11 14:42:47 +08:00
Shaoxu Cheng
899e5c492c merge NPU csrc to GPU: part 6 2025-09-11 14:40:34 +08:00
cen121212
4fd8fbe675 Merge pull request #5 from cen121212/br_whq_cmakelist_into_main
Merge CMakeLists.txt in for_arm
2025-09-11 11:12:49 +08:00
wanghanqingLYT
758e1790e0 Merge branch 'main-9-1' into br_whq_cmakelist_into_main 2025-09-11 10:22:22 +08:00
cen121212
aa11b35540 Merge pull request #2 from cen121212/br_whq_v0.2.4_into_main
merge arm branch for sgemm.cpp, tinyblas_cpu_sgemm.inc and iqk_mul_ma…
2025-09-11 09:37:50 +08:00
cen121212
cea135e56b Merge pull request #3 from WithHades/main-9-1
Main 9 1
2025-09-11 09:36:24 +08:00
cen121212
a582b2ad7d Merge pull request #4 from RICHARDNAN/br_czn_main-9-1
merge install.sh install_for_npu.sh setup.py
2025-09-11 09:36:03 +08:00
cen121212
0c9b3504dd Merge pull request #7 from cen121212/main-9-1-chengshaoxu
Merge npu csrc part to ktransformers
2025-09-11 09:34:56 +08:00
cen121212
27a9da62ba Merge pull request #8 from cen121212/main-9-1-luochen
适配npu----models/operators文件夹
2025-09-11 09:34:41 +08:00
djw
3550b03795 support qwen3 next 2025-09-10 18:55:33 +00:00
wanghanqingLYT
f2aec8032f npu enabling for deepseekv3 model and expert.py 2025-09-09 15:35:42 +08:00
cen121212
a9a9a95b0b 适配npu-models/operators文件夹4 2025-09-08 19:43:06 +08:00
cen121212
3aee5caa77 适配npu-models/operators文件夹3 2025-09-08 19:42:41 +08:00
Shaoxu Cheng
dcabb3ca6e merge NPU csrc to GPU: part 5 2025-09-08 19:42:33 +08:00
cen121212
ef2665a362 适配npu-models/operators文件夹2 2025-09-08 19:42:14 +08:00
Shaoxu Cheng
00f536622d merge NPU csrc to GPU: part 4 2025-09-08 19:42:14 +08:00
Shaoxu Cheng
1d94264992 merge NPU csrc to GPU: part 3 2025-09-08 19:41:13 +08:00
Shaoxu Cheng
dea06aa77f merge NPU csrc to GPU: part 2 2025-09-08 19:40:53 +08:00
cen121212
e0318c0fc3 适配npu-models/operators文件夹 2025-09-08 19:40:36 +08:00
Shaoxu Cheng
3125616ca2 merge NPU csrc to GPU: part 1 2025-09-08 19:40:17 +08:00
无脸男
f2a3ba0697 ktransformers 2025-09-08 17:49:18 +08:00
无脸男
cecc37841d balance serve 2025-09-08 17:46:26 +08:00
无脸男
35369ed6e3 completions 2025-09-08 17:30:49 +08:00
wanghanqingLYT
ed566b5f23 Merge CMakeLists.txt in for_arm 2025-09-08 17:24:53 +08:00
RICHARDNAN
c89959fe1d Update setup.py 2025-09-08 17:18:57 +08:00
无脸男
a344f2b5d4 utils 2025-09-08 17:18:07 +08:00
无脸男
76301e8e6e utils 2025-09-08 17:07:38 +08:00
RICHARDNAN
3e700fd536 Update merge_safetensor_gguf.py 2025-09-08 15:42:06 +08:00
RICHARDNAN
125558851e Delete serve_test.sh 2025-09-08 15:38:54 +08:00
RICHARDNAN
e0c8258b2a Delete scripts directory 2025-09-08 15:38:30 +08:00
RICHARDNAN
82b9bbfa49 Update install_for_npu.sh 2025-09-08 15:38:05 +08:00
RICHARDNAN
ba9e964fcd Update merge_safetensor_gguf.py 2025-09-08 15:34:30 +08:00
无脸男
3d8ff57f78 custom loader 2025-09-08 14:56:04 +08:00
无脸男
68eadd3bdc custom gguf loader 2025-09-08 14:51:25 +08:00
无脸男
9299c25e43 optimize.py 2025-09-08 14:48:54 +08:00
无脸男
d0432ed5c4 yaml 2025-09-08 14:46:33 +08:00
wanghanqingLYT
e7e2c2bd70 merge arm branch for sgemm.cpp, tinyblas_cpu_sgemm.inc and iqk_mul_mat.inc 2025-09-08 14:11:17 +08:00
Atream
72cd2a5af7 Merge pull request #1495 from kvcache-ai/Atream-patch-6
Update GGUF format link in Kimi-K2 documentation
2025-09-05 20:19:50 +08:00
Atream
64b3b30ba3 Update GGUF format link in Kimi-K2 documentation 2025-09-05 20:19:37 +08:00