[docs]: refine README for dpo updates (#1740)

* [docs]: refine dpo tutorial

* [docs]: refine README for dpo updates

* Update doc/en/DPO_tutorial.md

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

* [docs]: update website doc & refine location

---------

Co-authored-by: ErvinXie <ervinxie@foxmail.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: ZiWei Yuan <yzwliam@126.com>
This commit is contained in:
mrhaoxx
2025-12-24 11:20:08 +08:00
committed by GitHub
parent dee1e211d5
commit e7d277d163
4 changed files with 6 additions and 3 deletions

View File

@@ -7,8 +7,9 @@
# Tutorial
- [kt-sft part](en/SFT/README.md)
- [kt-sft developer tech notes](en/SFT/KTransformers-Fine-Tuning_Developer-Technical-Notes.md)
- [Injection Tutorial](en/SFT/injection_tutorial.md)
- [kt-sft developer tech notes](en/SFT/KTransformers-Fine-Tuning_Developer-Technical-Notes.md)
- [DPO tutorial](en/SFT/DPO_tutorial.md)
<!-- - [Multi-GPU Tutorial](en/multi-gpu-tutorial.md) -->
<!-- - [Use FP8 GPU Kernel](en/fp8_kernel.md) -->
<!-- - [Use AMD GPU](en/ROCm.md) -->

View File

@@ -61,7 +61,7 @@ pip install custom_flashinfer/
## Prepare Models
We uses `deepseek-ai/DeepSeek-V2-Lite` as an example here. You can replace it with other models such as Kimi K2.
We use `deepseek-ai/DeepSeek-V2-Lite` as an example here. You can replace it with other models such as Kimi K2.
## How to start