Logo
Explore Help
Register Sign In
ikawrakow/ik_llama.cpp
1
0
Fork 0
You've already forked ik_llama.cpp
mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-04-27 09:53:40 +00:00
Code Issues Packages Projects Releases Wiki Activity
Files
ik/fattn_bf16
ik_llama.cpp/ggml
History
Iwan Kawrakow 3e7d5c180c On Zen4 it is also better to not use large Q steps for fp16 K-cache
2025-01-15 18:09:07 +02:00
..
cmake
Merge mainline llama.cpp (#3)
2024-07-27 07:55:01 +02:00
include
Fix q8_0 KV cache when not using FA - WIP (AVX2)
2025-01-15 12:13:08 +02:00
src
On Zen4 it is also better to not use large Q steps for fp16 K-cache
2025-01-15 18:09:07 +02:00
.gitignore
Merge mainline llama.cpp (#3)
2024-07-27 07:55:01 +02:00
CMakeLists.txt
Move to c++17 projectwide (#80)
2024-10-04 14:43:26 +03:00
Powered by Gitea Version: 1.25.4 Page: 248ms Template: 3ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API