Files
ik_llama.cpp/github-data/pull_requests/111 - Use fused mul - unary op also for MoE models.md
2025-07-23 13:31:53 +02:00

335 B

🔀 #111 - Use fused mul - unary op also for MoE models

Author ikawrakow
State Closed
Created 2024-10-26
Updated 2024-10-26

Description

This gives us a ~1% speedup for MoE models on CUDA and Metal.