This website requires JavaScript.
Explore
Help
Register
Sign In
ikawrakow
/
ik_llama.cpp
Watch
1
Star
0
Fork
0
You've already forked ik_llama.cpp
mirror of
https://github.com/ikawrakow/ik_llama.cpp.git
synced
2026-03-05 11:30:09 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
ik/bench_gp
Add File
New File
Upload File
Apply Patch
ik_llama.cpp
/
ggml
History
Iwan Kawrakow
23e90dc325
Make q4_0_r4 work with tensor row sizes that are not a multiple of 128
...
... on Zen4. Also fix q8_0 K-cache for head sizes that are not multiple of 128.
2025-01-29 09:55:10 +02:00
..
cmake
Merge mainline llama.cpp (
#3
)
2024-07-27 07:55:01 +02:00
include
CPU Flash Attention improvements (
#172
)
2025-01-15 18:19:22 +02:00
src
Make q4_0_r4 work with tensor row sizes that are not a multiple of 128
2025-01-29 09:55:10 +02:00
.gitignore
Merge mainline llama.cpp (
#3
)
2024-07-27 07:55:01 +02:00
CMakeLists.txt
Move to c++17 projectwide (
#80
)
2024-10-04 14:43:26 +03:00