Files
ik_llama.cpp/ggml
Iwan Kawrakow 79a57b1554 GGML_UNARY_OP_SWIGLU: CUDA implementation
I observe ~12% speedup for PP-512(Phi-3.5-mini).
2024-09-28 10:31:59 +03:00
..
2024-07-27 07:55:01 +02:00
2024-09-28 10:11:15 +03:00
2024-07-27 07:55:01 +02:00