Logo
Explore Help
Register Sign In
ikawrakow/ik_llama.cpp
1
0
Fork 0
You've already forked ik_llama.cpp
mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-03-12 06:50:08 +00:00
Code Issues Packages Projects Releases Wiki Activity
Files
91b20d4becdbc9eb6ec377c1f532ddd30614a230
ik_llama.cpp/ggml
History
Iwan Kawrakow 91b20d4bec Use bperm trick for iq3_k gemv -> ~3% faster
2025-08-21 16:02:00 +03:00
..
cmake
Merge mainline llama.cpp (#3)
2024-07-27 07:55:01 +02:00
include
Revert "Better CPU prompt processing performance for SWA models (#696)" (#701)
2025-08-17 15:44:02 +03:00
src
Use bperm trick for iq3_k gemv -> ~3% faster
2025-08-21 16:02:00 +03:00
.gitignore
Merge mainline llama.cpp (#3)
2024-07-27 07:55:01 +02:00
CMakeLists.txt
Enable CUDA graphs for MoE models + GPT-OSS support (#689)
2025-08-15 09:18:07 +03:00
Powered by Gitea Version: 1.25.4 Page: 686ms Template: 11ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API