26 Commits

Author SHA1 Message Date
mrhaoxx
7a9daf0cd4 [feat](kt-kernel): support avx2 only inference for bf16 fp8 and gptq int4 (#1892)
* feat: support avx2 bf16 fp8 inference

* feat: support avx2 gptq int4 inference

* fix: numeric issues in fp8 dequant

* Tutorial avx2 (#1900)

* fix: prevent injecting -DLLAMA_AVX512=ON on AVX2-only machines

* docs: add AVX2 tutorial for running KTransformers on AVX2-only CPUs

* Tutorial avx2 (#1901)

* fix: prevent injecting -DLLAMA_AVX512=ON on AVX2-only machines

* docs: add AVX2 tutorial for running KTransformers on AVX2-only CPUs

* docs: update README.md

---------

Co-authored-by: Benjamin F <159887351+yyj6666667@users.noreply.github.com>
2026-03-27 14:45:02 +08:00
ZiWei Yuan
b096b01fbc [docs]: add kt-cli doc and update corresponding website (#1768) 2025-12-29 23:06:22 +08:00
mrhaoxx
e7d277d163 [docs]: refine README for dpo updates (#1740)
* [docs]: refine dpo tutorial

* [docs]: refine README for dpo updates

* Update doc/en/DPO_tutorial.md

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

* [docs]: update website doc & refine location

---------

Co-authored-by: ErvinXie <ervinxie@foxmail.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: ZiWei Yuan <yzwliam@126.com>
2025-12-24 11:20:08 +08:00
Peilin Li
171578a7ec [refactor]: Change named 'KT-SFT' to 'kt-sft' (#1626)
* Change named 'KT-SFT' to 'kt-sft'

* [docs]: update kt-sft name

---------

Co-authored-by: ZiWei Yuan <yzwliam@126.com>
2025-11-17 11:48:42 +08:00
ZiWei Yuan
be6db6f46b [docs]: improve structure for kt-kernel (#1624)
* [docs]: improve structure for kt-kernel

* Update doc/en/kt-kernel/README.md
2025-11-16 13:21:41 +08:00
ZiWei Yuan
133eea037c [docs]: improve docs structure (#1623) 2025-11-16 12:40:59 +08:00
ZiWei Yuan
c2d2edbeef [docs]: update the web docs structure (#1622) 2025-11-16 12:09:44 +08:00
ZiWei Yuan
c32fefb1cd [doc]: update web doc and kt-kernel doc (#1609)
* [doc]: update web doc and kt-kernel doc

* [doc](book.toml): add book.toml for rust book compile
2025-11-13 20:44:13 +08:00
KMSorSMS
0c15da437f [feat](cmake & doc): fix bug with cmake arch detect & update doc for sft 2025-11-04 08:46:26 +00:00
ZiWei Yuan
a0ce48ee21 Update SUMMARY.md 2025-04-03 12:00:34 +08:00
Azure-Tang
3a5330b215 Merge branch 'main' into work-concurrent 2025-04-01 06:48:19 +00:00
Atream
25cee5810e add balance-serve, support concurrence 2025-03-31 22:55:32 +08:00
liam
4748a912e2 📝 fix typo ktransformer->ktransformers 2025-03-17 17:54:00 +08:00
Azure-Tang
e5b001d76f Update readme; Format code; Add example yaml. 2025-03-14 14:25:52 -04:00
liam
05339ad0ef 📝 update benchmark.md 2025-02-25 23:57:58 +08:00
Azure
36fbeee341 Update doc 2025-02-25 08:21:18 +00:00
Azure
4dc5518e4d update fp8 kernel tutorial 2025-02-24 15:37:01 +00:00
wang jiahao
76554dd674 Update SUMMARY.md 2025-02-16 14:30:22 +08:00
liam
69b00753c2 📝 fix typo 2025-02-15 21:44:40 +08:00
Azure
ef89b1520b * Reorganize documentation/README
* Consolidate the installation section, as it's currently too cluttered
    * Move the Multi-GPU section to the top-level structure
    * Add a **detailed** tutorial on registering extra GPU memory with Marlin
2025-02-14 19:58:26 +00:00
Azure
4f4ed36442 Revert "[update] Reorganize documentation/README" 2025-02-15 03:43:48 +08:00
Azure
483182fc3a fix typo and detail 2025-02-14 19:40:15 +00:00
Azure
823b25eec9 Reorganize documentation/README 2025-02-14 19:08:17 +00:00
liam
8d5ebe49ab 📝 fix some debug output and update doc 2025-02-13 17:25:12 +08:00
liam
ad2c52d72a 📝 update doc 2025-02-13 17:16:27 +08:00
liam
c74453d8ca 📝 add doc support and fix bug in qwen2 2025-02-13 16:37:43 +08:00