ktransformers

mirror of https://github.com/kvcache-ai/ktransformers.git synced 2026-04-20 14:29:22 +00:00

Author	SHA1	Message	Date
Peilin Li	e637fedc65	[docs]: Add Full introduction of KT (#1636 )	2025-11-29 15:46:55 +08:00
Jiaqi Liao	46af8fcab5	[doc] fix kt parameters (#1629 )	2025-11-19 16:41:57 +08:00
Peilin Li	171578a7ec	[refactor]: Change named 'KT-SFT' to 'kt-sft' (#1626 ) * Change named 'KT-SFT' to 'kt-sft' * [docs]: update kt-sft name --------- Co-authored-by: ZiWei Yuan <yzwliam@126.com>	2025-11-17 11:48:42 +08:00
ZiWei Yuan	ab8ad0a110	[docs]: update web doc (#1625 )	2025-11-16 14:40:22 +08:00
ZiWei Yuan	be6db6f46b	[docs]: improve structure for kt-kernel (#1624 ) * [docs]: improve structure for kt-kernel * Update doc/en/kt-kernel/README.md	2025-11-16 13:21:41 +08:00
ZiWei Yuan	133eea037c	[docs]: improve docs structure (#1623 )	2025-11-16 12:40:59 +08:00
ZiWei Yuan	c32fefb1cd	[doc]: update web doc and kt-kernel doc (#1609 ) * [doc]: update web doc and kt-kernel doc * [doc](book.toml): add book.toml for rust book compile	2025-11-13 20:44:13 +08:00
Peilin Li	148a030026	upload hands-on tutorial with KTransformers-FT, especially in customize your KT-FT+LLaMA-Factory (#1597 ) * Add files via upload * upload hands-on tutorial for KTransformers-FT	2025-11-11 20:54:41 +08:00
Wenzhang-Chen	62b7b28a16	fix typo (#1452 )	2025-11-10 16:08:04 +08:00
Atream	b67cc4095d	Change attention backend to 'flashinfer' in launch command Updated the launch command to include 'flashinfer' as the attention backend.	2025-11-08 20:56:09 +08:00
Peilin Li	f4fe137023	Merge pull request #1572 from JimmyPeilinLi/main fix: remove py310 as guide	2025-11-08 16:57:10 +08:00
JimmyPeilinLi	1c08a4f0fb	fix: remove py310 as guide	2025-11-08 08:54:32 +00:00
Atream	0651dbda04	Simplify launch command by removing unused option Removed the unused '--attention-backend triton' option from the launch command.	2025-11-08 16:54:18 +08:00
Atream	d6ee384fe2	Fix download link for Kimi-K2-Thinking weights Updated the download link for AMX INT4 quantized weights.	2025-11-06 19:07:15 +08:00
Atream	d419024bb4	Add KTransformers SGLang inference documentation Add documentation for KTransformers SGLang inference deployment, including installation steps, model download links, server launch instructions, and performance benchmarks.	2025-11-06 17:53:58 +08:00
Peilin Li	803e645bc1	Update SFT Installation Guide for KimiK2 Added installation instructions and usage examples for KimiK2.	2025-11-06 17:34:21 +08:00
Peilin Li	d7ec838d5a	installation guide for KT+SFT(LoRA) in KimiK2 model	2025-11-06 17:27:42 +08:00
ZiWei Yuan	8192cc4166	Merge pull request #1551 from kvcache-ai/JimmyPeilinLi-patch-1 Revise GPU/CPU memory footprint information	2025-11-05 12:23:28 +08:00
Peilin Li	6721f8765d	Revise GPU/CPU memory footprint information Updated memory footprint details for DeepSeek models.	2025-11-05 12:11:19 +08:00
Peilin Li	4f9940700e	Update installation instructions	2025-11-04 23:06:05 +08:00
KMSorSMS	0c15da437f	[feat](cmake & doc): fix bug with cmake arch detect & update doc for sft	2025-11-04 08:46:26 +00:00
JimmyPeilinLi	7b6ccc3f57	add the docs and update README for KSFT	2025-11-04 05:51:48 +00:00
djw	0437660e62	fix bug	2025-09-16 13:21:58 +00:00
djw	a44b710649	support qwen3 next	2025-09-11 11:55:09 +00:00
Atream	64b3b30ba3	Update GGUF format link in Kimi-K2 documentation	2025-09-05 20:19:37 +08:00
Azure-Tang	b6d36bffbb	update kimi-k2-0905	2025-09-05 03:52:43 +00:00
djw	5771990a07	update smallthinker and glm4 readme	2025-07-31 03:14:49 +00:00
djw	c7307aa0ae	support smt and glm4	2025-07-25 16:24:38 +00:00
djw	17246bf84f	support smt and glm4	2025-07-25 15:03:27 +00:00
chenxl	b5024f62a4	Update Kimi-K2 Readme	2025-07-12 12:51:00 +08:00
Atream	34d2829f24	Update Kimi-K2.md	2025-07-12 12:44:41 +08:00
Atream	90245d8a6b	Update Kimi-K2.md	2025-07-12 11:57:51 +08:00
Atream	378e4fc035	Update Kimi-K2.md	2025-07-12 11:47:42 +08:00
Atream	b4ed8b6ded	Update Kimi-K2.md	2025-07-11 23:26:18 +08:00
UnicornChan	7800a413a2	Update Kimi-K2.md	2025-07-11 19:31:58 +08:00
Atream	b4ac21454b	Create Kimi-K2.md	2025-07-11 09:31:47 +08:00
ouqingliang	90cff820cf	update kvc disk path config.	2025-06-30 15:09:35 +00:00
ErvinXie	5a73aaf652	Update prefix_cache.md	2025-06-30 15:04:37 +08:00
ouqingliang	cc822df65d	add prefix cache documentation	2025-06-28 07:13:33 +00:00
Shaojun Liu	404ad39a04	docs: add Dockerfile.xpu and GPU driver setup instructions - Add Dockerfile.xpu for oneAPI-based container - Create Docker_xpu.md with usage instructions - Update xpu.md to include Docker guide	2025-05-28 13:55:35 +08:00
rnwang04	adc0906967	add XPU support for qwen3moe local chat	2025-05-22 21:01:41 +08:00
wang jiahao	32f3d7befb	Merge pull request #1307 from kvcache-ai/hyc add xpu parameters to install.sh	2025-05-17 15:25:33 +08:00
rnwang04	a56aa45186	fix ipex-llm version to 2.3.0rc1	2025-05-16 12:22:08 +08:00
Shaoyuan CHEN	5d194c5db0	Fix typos	2025-05-15 22:15:55 +08:00
Atream	7faa776659	Merge pull request #1277 from Coekjan/patch-1 Fix typo about `GLIBCXX_3.4.32`	2025-05-15 01:58:00 -06:00
Alisehen	055680e26c	add flashinfer to cuda device	2025-05-15 07:03:45 +00:00
Alisehen	f3be33a313	add xpu parameters to install.sh	2025-05-15 06:39:02 +00:00
Aubrey Li	72f6d93ffd	xpu.md: add device discovery tips	2025-05-15 14:12:26 +08:00
rnwang04	2f6e14a54b	fix md typo, fix code style, and update setup value error message	2025-05-15 10:14:39 +00:00
qiyuxinlin	d35d61f6a1	update readme	2025-05-14 13:15:18 +00:00

1 2 3 4

170 Commits