This website requires JavaScript.
Explore
Help
Register
Sign In
ikawrakow
/
ik_llama.cpp
Watch
1
Star
0
Fork
0
You've already forked ik_llama.cpp
mirror of
https://github.com/ikawrakow/ik_llama.cpp.git
synced
2026-02-26 08:04:09 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
8178075f8417822fcfe78bf8758a9cf43cc31239
ik_llama.cpp
/
ggml
History
Iwan Kawrakow
8178075f84
iq2_tn: small NEON improvement
...
For TriLM-3.9B we now get PP-512 = 206.6 t/s and TG-128 = 76.4 t/s.
2024-08-06 12:08:22 +02:00
..
cmake
Merge mainline llama.cpp (
#3
)
2024-07-27 07:55:01 +02:00
include
iq2_tn: TriLM specific 2.0625 bpw quantization
2024-08-05 14:22:05 +03:00
src
iq2_tn: small NEON improvement
2024-08-06 12:08:22 +02:00
.gitignore
Merge mainline llama.cpp (
#3
)
2024-07-27 07:55:01 +02:00
CMakeLists.txt
Merge mainline llama.cpp (
#3
)
2024-07-27 07:55:01 +02:00