update readme

This commit is contained in:
qiyuxinlin
2025-04-28 22:34:47 +00:00
parent e7763a4b59
commit 89823ccb1f
2 changed files with 3 additions and 6 deletions

View File

@@ -25,12 +25,11 @@ Our vision for KTransformers is to serve as a flexible platform for experimentin
* **Apr 29, 2025**: Support AMX-Int8 and AMX-BF16([Tutorial](./doc/en/AMX.md)). Support Qwen3MoE
<p align="center">
📹 <a href="https://github.com/user-attachments/assets/fafe8aec-4e22-49a8-8553-59fb5c6b00a2/202504290023-4.mov">
Qwen3MoE+AMX
</a>
<video src="[202504290023-4.mov](https://github.com/user-attachments/assets/fafe8aec-4e22-49a8-8553-59fb5c6b00a2)" controls width="640"></video>
</p>
* **Apr 9, 2025**: Experimental support for LLaMA 4 models ([Tutorial](./doc/en/llama4.md)).
* **Apr 2, 2025**: Support Multi-concurrency. ([Tutorial](./doc/en/balance-serve.md)).

View File

@@ -10,9 +10,7 @@ Consumer-grade CPU (Core i9-14900KF + dual-channel DDR4-4000 MT/s) + RTX 4090
The results are as follows:
<p align="center">
📹 <a href="https://github.com/user-attachments/assets/fafe8aec-4e22-49a8-8553-59fb5c6b00a2/202504290023-4.mov">
Qwen3MoE+AMX
</a>
<video src="[202504290023-4.mov](https://github.com/user-attachments/assets/fafe8aec-4e22-49a8-8553-59fb5c6b00a2)" controls width="640"></video>
</p>