mirror of
https://github.com/kvcache-ai/ktransformers.git
synced 2026-04-27 17:51:45 +00:00
Update Kimi-K2.md
This commit is contained in:
@@ -5,7 +5,7 @@
|
||||
### Overview
|
||||
We are very pleased to announce that Ktransformers now supports Kimi-K2.
|
||||
|
||||
On a single-socket CPU with one consumer-grade GPU, running the Q4_K_M model yields roughly 10 TPS and requires about 600 GB of VRAM.
|
||||
On a single-socket CPU with one consumer-grade GPU, running the Q4_K_M model yields roughly 10 TPS and requires about 600 GB of DRAM.
|
||||
With a dual-socket CPU and sufficient system memory, enabling NUMA optimizations increases performance to about 14 TPS.
|
||||
|
||||
### Model & Resource Links
|
||||
|
||||
Reference in New Issue
Block a user