This website requires JavaScript.
Explore
Help
Register
Sign In
ikawrakow
/
ik_llama.cpp
Watch
1
Star
0
Fork
0
You've already forked ik_llama.cpp
mirror of
https://github.com/ikawrakow/ik_llama.cpp.git
synced
2026-02-25 07:34:10 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
a4ffe2e69e86c97b0d854ce2bafcac71483e3f71
ik_llama.cpp
/
ggml
History
Iwan Kawrakow
a4ffe2e69e
q8_KV: AVX2 gemm/gemv
...
We get 254 t/s for L3-8B vs 194 t/s for q8_0 without rtr.
2025-02-19 10:03:15 +02:00
..
cmake
Merge mainline llama.cpp (
#3
)
2024-07-27 07:55:01 +02:00
include
Adding q8_KV - Basics + AVX2 gemm/gemv
2025-02-19 10:03:15 +02:00
src
q8_KV: AVX2 gemm/gemv
2025-02-19 10:03:15 +02:00
.gitignore
Merge mainline llama.cpp (
#3
)
2024-07-27 07:55:01 +02:00
CMakeLists.txt
FA: Add option to build all FA kernels (
#197
)
2025-02-09 18:59:33 +02:00