mrhaoxx
7a9daf0cd4
[feat](kt-kernel): support avx2 only inference for bf16 fp8 and gptq int4 ( #1892 )
...
* feat: support avx2 bf16 fp8 inference
* feat: support avx2 gptq int4 inference
* fix: numeric issues in fp8 dequant
* Tutorial avx2 (#1900 )
* fix: prevent injecting -DLLAMA_AVX512=ON on AVX2-only machines
* docs: add AVX2 tutorial for running KTransformers on AVX2-only CPUs
* Tutorial avx2 (#1901 )
* fix: prevent injecting -DLLAMA_AVX512=ON on AVX2-only machines
* docs: add AVX2 tutorial for running KTransformers on AVX2-only CPUs
* docs: update README.md
---------
Co-authored-by: Benjamin F <159887351+yyj6666667@users.noreply.github.com >
2026-03-27 14:45:02 +08:00
ZiWei Yuan
b096b01fbc
[docs]: add kt-cli doc and update corresponding website ( #1768 )
2025-12-29 23:06:22 +08:00
mrhaoxx
e7d277d163
[docs]: refine README for dpo updates ( #1740 )
...
* [docs]: refine dpo tutorial
* [docs]: refine README for dpo updates
* Update doc/en/DPO_tutorial.md
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
* [docs]: update website doc & refine location
---------
Co-authored-by: ErvinXie <ervinxie@foxmail.com >
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: ZiWei Yuan <yzwliam@126.com >
2025-12-24 11:20:08 +08:00
Peilin Li
171578a7ec
[refactor]: Change named 'KT-SFT' to 'kt-sft' ( #1626 )
...
* Change named 'KT-SFT' to 'kt-sft'
* [docs]: update kt-sft name
---------
Co-authored-by: ZiWei Yuan <yzwliam@126.com >
2025-11-17 11:48:42 +08:00
ZiWei Yuan
be6db6f46b
[docs]: improve structure for kt-kernel ( #1624 )
...
* [docs]: improve structure for kt-kernel
* Update doc/en/kt-kernel/README.md
2025-11-16 13:21:41 +08:00
ZiWei Yuan
133eea037c
[docs]: improve docs structure ( #1623 )
2025-11-16 12:40:59 +08:00
ZiWei Yuan
c2d2edbeef
[docs]: update the web docs structure ( #1622 )
2025-11-16 12:09:44 +08:00
ZiWei Yuan
c32fefb1cd
[doc]: update web doc and kt-kernel doc ( #1609 )
...
* [doc]: update web doc and kt-kernel doc
* [doc](book.toml): add book.toml for rust book compile
2025-11-13 20:44:13 +08:00
KMSorSMS
0c15da437f
[feat](cmake & doc): fix bug with cmake arch detect & update doc for sft
2025-11-04 08:46:26 +00:00
ZiWei Yuan
a0ce48ee21
Update SUMMARY.md
2025-04-03 12:00:34 +08:00
Azure-Tang
3a5330b215
Merge branch 'main' into work-concurrent
2025-04-01 06:48:19 +00:00
Atream
25cee5810e
add balance-serve, support concurrence
2025-03-31 22:55:32 +08:00
liam
4748a912e2
📝 fix typo ktransformer->ktransformers
2025-03-17 17:54:00 +08:00
Azure-Tang
e5b001d76f
Update readme; Format code; Add example yaml.
2025-03-14 14:25:52 -04:00
liam
05339ad0ef
📝 update benchmark.md
2025-02-25 23:57:58 +08:00
Azure
36fbeee341
Update doc
2025-02-25 08:21:18 +00:00
Azure
4dc5518e4d
update fp8 kernel tutorial
2025-02-24 15:37:01 +00:00
wang jiahao
76554dd674
Update SUMMARY.md
2025-02-16 14:30:22 +08:00
liam
69b00753c2
📝 fix typo
2025-02-15 21:44:40 +08:00
Azure
ef89b1520b
* Reorganize documentation/README
...
* Consolidate the installation section, as it's currently too cluttered
* Move the Multi-GPU section to the top-level structure
* Add a **detailed** tutorial on registering extra GPU memory with Marlin
2025-02-14 19:58:26 +00:00
Azure
4f4ed36442
Revert "[update] Reorganize documentation/README"
2025-02-15 03:43:48 +08:00
Azure
483182fc3a
fix typo and detail
2025-02-14 19:40:15 +00:00
Azure
823b25eec9
Reorganize documentation/README
2025-02-14 19:08:17 +00:00
liam
8d5ebe49ab
📝 ⚡ fix some debug output and update doc
2025-02-13 17:25:12 +08:00
liam
ad2c52d72a
📝 update doc
2025-02-13 17:16:27 +08:00
liam
c74453d8ca
📝 add doc support and fix bug in qwen2
2025-02-13 16:37:43 +08:00