Files
ik_llama.cpp/github-data/issues/26 - Feature Request_ Improve CPU processing speed for large contexts.md
2025-07-23 13:31:53 +02:00

974 B

#26 - Feature Request: Improve CPU processing speed for large contexts

Author ikawrakow
State Open
Created 2024-08-22

Description

Prerequisites

  • I am running the latest code. Mention the version if possible as well.
  • I carefully followed the README.md.
  • I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
  • I reviewed the Discussions, and have a new and useful enhancement to share.

Feature Description

Recent open source / open weight models provide long context window, and hence it would be useful to improve CPU processing speed for large prompts.

Motivation

See #25

Possible Implementation

See #25