mirror of
https://github.com/kvcache-ai/ktransformers.git
synced 2026-04-20 06:18:59 +00:00
update readme
This commit is contained in:
@@ -25,12 +25,11 @@ Our vision for KTransformers is to serve as a flexible platform for experimentin
|
||||
* **Apr 29, 2025**: Support AMX-Int8 and AMX-BF16([Tutorial](./doc/en/AMX.md)). Support Qwen3MoE
|
||||
|
||||
<p align="center">
|
||||
📹 <a href="https://github.com/user-attachments/assets/fafe8aec-4e22-49a8-8553-59fb5c6b00a2/202504290023-4.mov">
|
||||
Qwen3MoE+AMX
|
||||
</a>
|
||||
<video src="[202504290023-4.mov](https://github.com/user-attachments/assets/fafe8aec-4e22-49a8-8553-59fb5c6b00a2)" controls width="640"></video>
|
||||
</p>
|
||||
|
||||
|
||||
|
||||
* **Apr 9, 2025**: Experimental support for LLaMA 4 models ([Tutorial](./doc/en/llama4.md)).
|
||||
* **Apr 2, 2025**: Support Multi-concurrency. ([Tutorial](./doc/en/balance-serve.md)).
|
||||
|
||||
|
||||
@@ -10,9 +10,7 @@ Consumer-grade CPU (Core i9-14900KF + dual-channel DDR4-4000 MT/s) + RTX 4090
|
||||
The results are as follows:
|
||||
|
||||
<p align="center">
|
||||
📹 <a href="https://github.com/user-attachments/assets/fafe8aec-4e22-49a8-8553-59fb5c6b00a2/202504290023-4.mov">
|
||||
Qwen3MoE+AMX
|
||||
</a>
|
||||
<video src="[202504290023-4.mov](https://github.com/user-attachments/assets/fafe8aec-4e22-49a8-8553-59fb5c6b00a2)" controls width="640"></video>
|
||||
</p>
|
||||
|
||||
|
||||
|
||||
Reference in New Issue
Block a user