ik_llama.cpp/ggml at af91231f937353eb396489eb95ff155d9b79e06e - ik_llama.cpp - Public git mirror

ikawrakow/ik_llama.cpp

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-03-08 04:50:13 +00:00

Files

History

Iwan Kawrakow af91231f93 FlashMLA: allow for f16 and bf16 cache in addition to q8_0

2025-03-02 14:22:35 +02:00

..

Merge mainline llama.cpp (#3 )

2024-07-27 07:55:01 +02:00

SER - Smart Expert Reduction (#239 )

2025-03-02 13:47:38 +02:00

FlashMLA: allow for f16 and bf16 cache in addition to q8_0

2025-03-02 14:22:35 +02:00

.gitignore

Merge mainline llama.cpp (#3 )

2024-07-27 07:55:01 +02:00

CMakeLists.txt

FA: Add option to build all FA kernels (#197 )

2025-02-09 18:59:33 +02:00