ik_llama.cpp/452 - Falcon H1 Support.md at 993cb00a347fc77632b73126f614092d659727de - ik_llama.cpp

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-02-19 20:54:36 +00:00

Files

Thomas eaa2510a28 Add GitHub data: filename sanitization (#640 )

2025-07-23 13:31:53 +02:00

Officially supported via their fork of llama.cpp here: https://github.com/tiiuae/llama.cpp-Falcon-H1

Support for ik_llama.cpp's tighter quantization schemes would be nice :). Maybe something in this fork can shrink the Mamba2 context cache as well?

👤 ikawrakow commented the 2025-05-24 at 07:04:24:

Have you though about adding a feature request to the llama.cpp-Falcon-H1 authors?

👤 Downtown-Case commented the 2025-06-02 at 18:19:21:

Seems their implementation needs more time in the oven anyway.

👤 Downtown-Case commented the 2025-06-27 at 14:31:42:

Closing this