Peilin Li
e637fedc65
[docs]: Add Full introduction of KT ( #1636 )
2025-11-29 15:46:55 +08:00
Jiaqi Liao
46af8fcab5
[doc] fix kt parameters ( #1629 )
2025-11-19 16:41:57 +08:00
Peilin Li
171578a7ec
[refactor]: Change named 'KT-SFT' to 'kt-sft' ( #1626 )
...
* Change named 'KT-SFT' to 'kt-sft'
* [docs]: update kt-sft name
---------
Co-authored-by: ZiWei Yuan <yzwliam@126.com >
2025-11-17 11:48:42 +08:00
ZiWei Yuan
ab8ad0a110
[docs]: update web doc ( #1625 )
2025-11-16 14:40:22 +08:00
ZiWei Yuan
be6db6f46b
[docs]: improve structure for kt-kernel ( #1624 )
...
* [docs]: improve structure for kt-kernel
* Update doc/en/kt-kernel/README.md
2025-11-16 13:21:41 +08:00
ZiWei Yuan
133eea037c
[docs]: improve docs structure ( #1623 )
2025-11-16 12:40:59 +08:00
ZiWei Yuan
c32fefb1cd
[doc]: update web doc and kt-kernel doc ( #1609 )
...
* [doc]: update web doc and kt-kernel doc
* [doc](book.toml): add book.toml for rust book compile
2025-11-13 20:44:13 +08:00
Peilin Li
148a030026
upload hands-on tutorial with KTransformers-FT, especially in customize your KT-FT+LLaMA-Factory ( #1597 )
...
* Add files via upload
* upload hands-on tutorial for KTransformers-FT
2025-11-11 20:54:41 +08:00
Wenzhang-Chen
62b7b28a16
fix typo ( #1452 )
2025-11-10 16:08:04 +08:00
Atream
b67cc4095d
Change attention backend to 'flashinfer' in launch command
...
Updated the launch command to include 'flashinfer' as the attention backend.
2025-11-08 20:56:09 +08:00
Peilin Li
f4fe137023
Merge pull request #1572 from JimmyPeilinLi/main
...
fix: remove py310 as guide
2025-11-08 16:57:10 +08:00
JimmyPeilinLi
1c08a4f0fb
fix: remove py310 as guide
2025-11-08 08:54:32 +00:00
Atream
0651dbda04
Simplify launch command by removing unused option
...
Removed the unused '--attention-backend triton' option from the launch command.
2025-11-08 16:54:18 +08:00
Atream
d6ee384fe2
Fix download link for Kimi-K2-Thinking weights
...
Updated the download link for AMX INT4 quantized weights.
2025-11-06 19:07:15 +08:00
Atream
d419024bb4
Add KTransformers SGLang inference documentation
...
Add documentation for KTransformers SGLang inference deployment, including installation steps, model download links, server launch instructions, and performance benchmarks.
2025-11-06 17:53:58 +08:00
Peilin Li
803e645bc1
Update SFT Installation Guide for KimiK2
...
Added installation instructions and usage examples for KimiK2.
2025-11-06 17:34:21 +08:00
Peilin Li
d7ec838d5a
installation guide for KT+SFT(LoRA) in KimiK2 model
2025-11-06 17:27:42 +08:00
ZiWei Yuan
8192cc4166
Merge pull request #1551 from kvcache-ai/JimmyPeilinLi-patch-1
...
Revise GPU/CPU memory footprint information
2025-11-05 12:23:28 +08:00
Peilin Li
6721f8765d
Revise GPU/CPU memory footprint information
...
Updated memory footprint details for DeepSeek models.
2025-11-05 12:11:19 +08:00
Peilin Li
4f9940700e
Update installation instructions
2025-11-04 23:06:05 +08:00
KMSorSMS
0c15da437f
[feat](cmake & doc): fix bug with cmake arch detect & update doc for sft
2025-11-04 08:46:26 +00:00
JimmyPeilinLi
7b6ccc3f57
add the docs and update README for KSFT
2025-11-04 05:51:48 +00:00
djw
0437660e62
fix bug
2025-09-16 13:21:58 +00:00
djw
a44b710649
support qwen3 next
2025-09-11 11:55:09 +00:00
Atream
64b3b30ba3
Update GGUF format link in Kimi-K2 documentation
2025-09-05 20:19:37 +08:00
Azure-Tang
b6d36bffbb
update kimi-k2-0905
2025-09-05 03:52:43 +00:00
djw
5771990a07
update smallthinker and glm4 readme
2025-07-31 03:14:49 +00:00
djw
c7307aa0ae
support smt and glm4
2025-07-25 16:24:38 +00:00
djw
17246bf84f
support smt and glm4
2025-07-25 15:03:27 +00:00
chenxl
b5024f62a4
Update Kimi-K2 Readme
2025-07-12 12:51:00 +08:00
Atream
34d2829f24
Update Kimi-K2.md
2025-07-12 12:44:41 +08:00
Atream
90245d8a6b
Update Kimi-K2.md
2025-07-12 11:57:51 +08:00
Atream
378e4fc035
Update Kimi-K2.md
2025-07-12 11:47:42 +08:00
Atream
b4ed8b6ded
Update Kimi-K2.md
2025-07-11 23:26:18 +08:00
UnicornChan
7800a413a2
Update Kimi-K2.md
2025-07-11 19:31:58 +08:00
Atream
b4ac21454b
Create Kimi-K2.md
2025-07-11 09:31:47 +08:00
ouqingliang
90cff820cf
update kvc disk path config.
2025-06-30 15:09:35 +00:00
ErvinXie
5a73aaf652
Update prefix_cache.md
2025-06-30 15:04:37 +08:00
ouqingliang
cc822df65d
add prefix cache documentation
2025-06-28 07:13:33 +00:00
Shaojun Liu
404ad39a04
docs: add Dockerfile.xpu and GPU driver setup instructions
...
- Add Dockerfile.xpu for oneAPI-based container
- Create Docker_xpu.md with usage instructions
- Update xpu.md to include Docker guide
2025-05-28 13:55:35 +08:00
rnwang04
adc0906967
add XPU support for qwen3moe local chat
2025-05-22 21:01:41 +08:00
wang jiahao
32f3d7befb
Merge pull request #1307 from kvcache-ai/hyc
...
add xpu parameters to install.sh
2025-05-17 15:25:33 +08:00
rnwang04
a56aa45186
fix ipex-llm version to 2.3.0rc1
2025-05-16 12:22:08 +08:00
Shaoyuan CHEN
5d194c5db0
Fix typos
2025-05-15 22:15:55 +08:00
Atream
7faa776659
Merge pull request #1277 from Coekjan/patch-1
...
Fix typo about `GLIBCXX_3.4.32`
2025-05-15 01:58:00 -06:00
Alisehen
055680e26c
add flashinfer to cuda device
2025-05-15 07:03:45 +00:00
Alisehen
f3be33a313
add xpu parameters to install.sh
2025-05-15 06:39:02 +00:00
Aubrey Li
72f6d93ffd
xpu.md: add device discovery tips
2025-05-15 14:12:26 +08:00
rnwang04
2f6e14a54b
fix md typo, fix code style, and update setup value error message
2025-05-15 10:14:39 +00:00
qiyuxinlin
d35d61f6a1
update readme
2025-05-14 13:15:18 +00:00