ik_llama.cpp

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-04-21 15:09:40 +00:00

Files

Iwan Kawrakow 79a57b1554 GGML_UNARY_OP_SWIGLU: CUDA implementation

I observe ~12% speedup for PP-512(Phi-3.5-mini).

2024-09-28 10:31:59 +03:00

2024-07-27 07:55:01 +02:00

2024-09-28 10:11:15 +03:00

2024-09-28 10:31:59 +03:00

.gitignore

2024-07-27 07:55:01 +02:00

CMakeLists.txt

2024-08-12 15:14:32 +02:00