SCDESPERTATE
|
b0f827d2a9
|
[chore](cuda): explicitly use ele_per_blk var for better readability (#1784)
|
2026-01-23 11:11:08 +08:00 |
|
mrhaoxx
|
b27de4068b
|
[fix]: fix exp_avx512 for act_fn (#1797)
|
2026-01-20 11:07:22 +08:00 |
|
mrhaoxx
|
e7d277d163
|
[docs]: refine README for dpo updates (#1740)
* [docs]: refine dpo tutorial
* [docs]: refine README for dpo updates
* Update doc/en/DPO_tutorial.md
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
* [docs]: update website doc & refine location
---------
Co-authored-by: ErvinXie <ervinxie@foxmail.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: ZiWei Yuan <yzwliam@126.com>
|
2025-12-24 11:20:08 +08:00 |
|
Jianwei Dong
|
1f79f6da92
|
[feat](kt-kernel): Add automatic deployment workflow (#1719)
|
2025-12-16 15:20:06 +08:00 |
|
mrhaoxx
|
f992de55da
|
[fix](kt-sft): fix peft adaptations for RL tasks (#1674)
|
2025-12-09 14:28:51 +08:00 |
|
Peilin Li
|
7ee80bbc3d
|
[docs]: Update README with Python 3.12 and dependency changes (#1634)
Updated Python version in installation instructions and adjusted KTransformers and flash-attention wheel filenames accordingly.
|
2025-11-29 15:46:05 +08:00 |
|
Jiaqi Liao
|
e69c67713f
|
[refactor] fix third_party issue (#1632)
* [refactor]: relocate third_party directory
* [fix]: fix custom_flashinfer for kt-sft
|
2025-11-20 13:55:55 +08:00 |
|
Peilin Li
|
171578a7ec
|
[refactor]: Change named 'KT-SFT' to 'kt-sft' (#1626)
* Change named 'KT-SFT' to 'kt-sft'
* [docs]: update kt-sft name
---------
Co-authored-by: ZiWei Yuan <yzwliam@126.com>
|
2025-11-17 11:48:42 +08:00 |
|